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A MODEL FOR THE DETERIORATION IN STRENGTH OF 
MATERIALS DUE TO FUNGAL ATTACK 


J. S. Maritz 


Transvaal and Orange Free State Chamber of Mines Research Laboratories, 
Johannesburg, South Africa 


INTRODUCTION 


Many materials used by industries are subject to fungal attack 
and the conditions to which they are subjected are frequently favourable 
to the growth of fungi. The result of fungal attack is usually that the 
material gradually loses its strength and eventually 4lisintegrates; but 
it may actually become useless long before such an advanced stage 
is reached. On the other hand, due to slow decrease in strength the 
material may serve its purpose for a long time even though it is being 
attacked by fungi. Laboratory studies of the rate of decrease in strength 
under different conditions are, therefore, of considerable importance. 

Development of a mathematical model for the loss of strength due 
to fungal attack seems desirable for at least two reasons. They are: 


(i) the results of the laboratory experiments are usually observations 
on a number of random variables and a mathematical model 
provides a description and summary of a set of data which can 
be very useful, particularly when different sets are to be com- 
pared with each other. 

(ii) as many of these investigations are rather lengthy, a sound 
model can be of great use in the efficient planning of experiments. 


The basic assumptions on which the model in this paper rests are: 
if a specimen of material is exposed (to fungal attack), a certain period 
of time will elapse before the attack actually starts and, once it starts, 
the strength decreases gradually to zero. Both the time of ‘survival’ 
and the rate at which the strength decreases may be random variables. 

A specific example, in which connection this approach has been 
successful, is encountered on the South African gold and coal mines 
where various fabrics are used underground for dust filtering and other 
purposes. An example of a laboratory experiment with Cotton Duck 
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in which tensile strengths of pieces of standard shape and size were 
measured is given in a later section. 


MATHEMATICAL MODEL: GENERAL FORM 


Suppose that a number of sample pieces from the same batch of a 
material are exposed to fungal attack under conditions as nearly identical 
as possible. The attack on individual samples will actually start after 
time intervals ¢, , t, , --- , where ¢ = 0 corresponds to the start of the 
experiment. Each ¢; may be regarded as a realization of a random 
variable ¢ with probability density g(t)dt, so that the expected propor- 
tion of samples remaining with no attack after time ¢ is 


1- f 


The initial (i.e. unexposed) strength will vary from one sample to 
another, but, for the moment, we consider only samples with the same 
initial strength z,. Take one on which attack starts at time ¢, , ice. 
during the interval ¢; + 36¢, . At t, (t, > ¢,), due to the attack the 
strength has decreased to x,f(t, — ¢,). This function f(t, — é) must 
satisfy two conditions: 


(a) =1 for 
(b) f(t, 4) as ft, increases. 


The form of f may itself be a random variable, but we make the simplify- 

ing assumption that it remains fixed and that its value for fixed (¢, — t) 

is a random variable only by virtue of its dependence on a constant 

a which is a random variable. This means that the samples on which 

attack starts at ¢, will at ¢, have strengths xof(t, — t, , a,), %of(te — , a2) 
- with expected mean value 


E,[tof(te — 4 = — , 


We are now able to calculate the expected mean value at time t 
of all samples with initial strength x, on the assumption that the random 
variables x and ¢ are independent: the total proportion of attacked 
samples after time ¢ is G(t). The proportion of these attacked during 
the interval 7 + 367 is g(r)6r/G(t), and the mean strength of these at 
time t(> 7) is 72, [f(t — 7, a)]. Therefore the mean of all attacked 
samples at time ¢ is 
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and it follows that the overall mean at ¢ is 


(1) 


Assuming that the random variables a and x are independent, we find 
that the expected mean strength at time ¢ is 


— 1, dr + — G00). 


t 
0 
where (x) is the expected mean strength of unexposed samples. 
MATHEMATICAL MODEL: SPECIAL FORMS 


The special forms which the functions g(t) and E,[f(t2 — t, , a)] 
may assume are, of course, numerous and we will give only two special 
forms of £(y,) resulting from what appear to be reasonable assumptions 
about the two functions. 

The function g(t). The distribution of ¢ must be continuous starting 
at ¢ = 0 and covering the range 0 — ~. A simple and plausible choice 
for g(t) is 

g(t) = (3) 


This distribution has only one parameter, 8, with the ‘physical’ meaning 
that the mean time of survival before attack starts is 1/8. We were 
influenced in this choice of g(t) by the knowledge that this type of 
distribution has been found to fit a large variety of ‘life’ distributions, 
e.g., life of electron tubes [Epstein 1954]. 

The function E,[f(t. — t, , a)]. In considering this function we 
must really take account of two functions, i.e. f and the probability 
density p(a) of the random variable a. We have examined various 
forms of f and found 


(4) 

one of the most easily handled and also practically successful. 
The probability distribution of a. Unless fairly simple forms for 
p(a) are used in conjunction with (4), the resulting expression for 
E.\f(t. — t, , a)] ean be very awkward indeed. Following are two 


examples of special forms of E(y,) obtained by using (3), (4), and two 
very simple forms of p(a): 


Example (7) 
We note that, substituting (3) and (4) in (2) gives the result 


-at 


© 
Ey.) = E(2) p(a) da + (5) 
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If we put 
p(a) = 6(a — a), (6) 


the Dirac delta function, i.e. a assumes the value a with probability 
1, (5) becomes 


Ely.) = — (7) 


This is a simple expression which may readily be ‘fitted’ to observed 
data, and we have found it to be applicable to many of our laboratory 
experimental results. An example will be given in a later section. 


Example (ii)* 


If we assume, instead of (6), that the distribution of a is located 
at a and has a small variance o”, expression (2) may be written approxi- 
mately as 


By) Be) [ 0) — ar 


(8) 
+ E(a)[1 — Gd], 
and substituting (3) and (4) we have 
= E(x) —Bt -at 


This is a more flexible expression than (7), as can be seen more 
clearly if we consider the special case 8 = a which gives for (7) and 
(9) respectively, 


E(y,) = E(xje~*"(1 + at) 
and 


2 
E(y,) = + at + 
As the form (7) appears, despite its simplicity, to be of real practical 


use, the following section gives some detail about estimating its param 
eters. 


*I would like to acknowledge my indebtedness to one of the referees for drawing my attention 
to this example. 
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METHODS OF ESTIMATING THE PARAMETERS IN EQUATION (7) 


The parameters are E(x) = £, , a and 6. As our main concern so 
far has been to see whether the model fits our observations at all, we 
have not gone into the question of estimation very carefully and can 
suggest only some crude methods of estimating the parameters. 

Considering the way in which the model is built up, it is clear that 
in some circumstances £, and 6 may be estimated without using the 
observed regression of y, on t. 

Estimate of &, . By definition £, is the expected mean value of 
unexposed unattacked samples so that if we have N such samples with 
strengths x, --+ ty an estimate of £, is simply 


é, = z,/N. (10) 


As it is usually true that N can be made large without much trouble 
or expense this procedure may be recommended. 

Estimate of 8. Sometimes it is possible (e.g. by microscopic exami- 
nation) to discriminate between attacked and unattacked samples 
before they are, as is often the case, tested to destruction. When this 


can be done, a series p, , p2 , --* of observed proportions of attacked 
samples at times ¢, , --- is found. According to (3) 
E(p) = 1 (11) 


Fitting the expression (11) through the observed p, by, for example, 
the maximum likelihood method gives an estimate of 8. 

Estimate of a when 8 and £, are known. A weighted least squares 
procedure appears to be advisable. If the observed values of y, at ¢; 
(> 0) are yi. +--+ Yin, and the weights are w, --- w, , we minimize 


2 
So wo) (a B) ae | (12) 
with respect to a, where y;, = >>; y;;/n, , and the result is 


wily:. — — Bie = 0 (13) 


where £,Y; = E(y,,). 

If ~, and 8 are unknown but estimated as above, those estimates 
may be substituted in (13), which procedure will lead to a consistent 
estimate of a. 

Joint estimation of a and B by weighted least squares. When a separate 
estimate of 8 cannot be obtained, a and 6 may be estimated jointly 
by minimizing S, [see equation (12)] with regard to both a and 8. 
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One difficulty is, however, that a and 8 are interchangeable in the 
expression (7). It may be resolved by making use of the fact that 


= — = 8,1), (14) 


where & = E(x’), which may be derived in the same way as the value 
of E(y,). For, it may be shown that a consistent estimate of a is obtained 
if, of the two possible estimates of a, that one is chosen which gives the 
smaller difference, in some sense between £,4(2@, 8) and the observed 
values dy; /n,. Even a crude rule such as: choose a = or according 
to whether u < v or u > v where 


(15) 
v= yi;/n; — £,8(28, &, t,)] 


can be satisfactory. 

Standard errors of weighted least squares estimates of a and 8. By 
argument similar to that used, for example, by Kendall [Kendall, 
1948, p. 208] to calculate the standard errors of functions of moments, 
one can show that the variances and covariances of least squares 
estimates @ and @ of a and 8 are approximately given by 


cov (@, 8) = (A")B(A™’)’, (16) 


where 


L da 
(@ — B) (ae*** — = £,6@, B, t,), 


and of = var (y;.). 
If w; is chosen equal to 1/o; , the result is simply 


cov (8, 8) = (17) 
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To take account of the variance of the estimate £, of &, , the elements 
of B must be increased by amounts 


(x var (é,), ete. 
Finally, the variance of E(y,) is approximately (for £, known) given by 


PRACTICAL EXAMPLE 


Standard size pieces of Cotton Duck treated with copper naphthenate 
were exposed to attack by a test fungus (T.R.L. 255 Chaetomium 
Globosum). Altogether 30 pieces were exposed and by random choice 
they were tested for tensile strength in groups of six each after 13, 26, 
39, 52, and 65 days of exposure. The tensile strengths of ten unexposed 
pieces were measured. The individual tensile strength values (in 
arbitrary units) are given in Table 1. 


TABLE 1 
TENSILE StRENGTHs OF Cotton Duck 
Period of exposure in arbitrary time units 
Unexposed 1 2 3 4 5 

100.5 76.0 24.2 13.0 1.5 0.0 
88.2 67.3 33.3 3.5 0.0 0.0 
107.5 66.5 28.3 13.0 4.8 0.0 
102.5 58.6 37.3 8.3 2.5 2.5 
96.0 67.3 34.5 5.0 0.5 0.0 
107.8 74.8 22.0 5.0 5.8 2.0 
94.0 
103.5 

98.9 

99.0 


All the pieces examined after 13 days of exposure were attacked 
so that these data were not considered suitable for estimating 6 as 
suggested by equation (11). As estimate of ¢, , the mean of the un- 
exposed pieces was used, ie. £, = 99.79. Estimates of a and @ were 
then found by minimizing S, with respect to a and 8, where £, was 
substituted for ¢, and the weights w, were taken as w; = n,/s; where 
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s; is the observed variance of the strengths at time ¢;.. The result was 
that the two values of @ and § were 1.30 and 1.15. The observations 
and the graph £,4(1.30, 1.15) are plotted in the accompanying figure 
and it is clearly a very good fit. A plot of the squares of the observations 
and the curves £,6(2.60, 1.15) and (1.30, 2.30), showed by inspection 
that we should take @ = 1.30, 8 = 1.15. 


ome sens 


TIME 
FIGURE 1 

TENSILE STRENGTHS oF Corron Duck 


The variances and covariances of & and § calculated as suggested 
above turned out to be 


cov 8) & 


1.323 
—1.003 
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These values are surprisingly large, due, we believe, to the fact that in 
this example @ is very nearly equal to 8. However, due to the high 
negative correlation between @ and , using equation (18) shows that 
E(y.,) may be estimated with reasonable precision. For example at 
t = 1, sd. Bly.) = 2.12. Of course, bringing into the calculation 
the variance of £, increases this standard deviation and the result is 
s.d. B(y,) = 3.18. Att = 5 the two standard deviations are respectively 
.28 and .32. No details are given of the calculation incorporating var 


(é,) because normally it is easy to make N so large that var (£,) may 
be made negligible. 
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OPTIMUM GROUP SIZE IN PROGENY TESTING AND 
FAMILY SELECTION 


ALAN ROBERTSON 
Institute of Animal Genetics, Edinburgh, Scotland 


The research worker concerned with animal breeding problems is 
frequently asked the question ““How many progeny do I need to test 
a sire adequately?” This is an awkward, if not impossible, question 
to answer because it depends on the meaning of the word “adequately.” 
The more progeny the more accurate the assessment—the level of 
accuracy that is considered adequate is a matter of subjective judgment. 
But, by making the content of the question a little more practical and 
precise, it can be put in forms to which a useful solution is possible. 

In any progeny-testing programme (this implying a_ breeding 
programme, relying on progeny-testing for improvement, rather than a 
method of evaluating sires), two factors external to the actual testing 
procedure will define the programme. They are the facilities available 
(limiting the total number of progeny which can be tested in any genera- 
tion) and the number of sires that will be selected each generation. 
The latter will be limited by the rate of inbreeding which can be tolerated 
in the programme. The further variable factor in the programme is 
the number of sires which will be tested each generation. This, in 
relation to the available facilities, determines the group size. The 
problem then is to determine the group size which will lead to the 
maximum expected genetic superiority of the selected sires. This is 
a balance of choice against accuracy. The more sires are tested the 
greater will be the choice, but the less accurate the information on 
which the choice is made, because the number of progeny per sire is 
less. This problem can be solved within the framework of statistical 
genetics. 

Within this framework, the expected genetic superiority of chosen 
sires is then given by 


AG = 


where 7 is the selection intensity (in standard units) corresponding 
to the proportion of sires selected, r;¢ is the correlation between the 
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progeny average J and the breeding value of a sire, and a, is the genetic 
standard deviation in the population. If we can assume a normal 
distribution of the breeding value of sires, then 7 can be obtained from 
the proportion selected from standard tables. If we know the herit- 
ability h? of the character on which selection is based, then if there are 
no non-genetic causes of differences between sire groups, we have 


where 7 is the number of progeny in the group. When we have testing 
facilities for N animals and a knowledge of the number of chosen 
sires, S, required, we can easily calculate the value of n which maximises 
AG. The results are illustrated in Table 1 for N = 100, S = 2, h? = 0.25. 


TABLE 1 


Tue Expected GENETIC SUPERIORITY OF SELECTED SrrEs IN A PROGENY 
SCHEME 


(N = total progeny measured, S = sires selected, n = offspring/sire, 
AG = genetic superiority of selected sires in genetic standard deviations). 
Group size (n) ay 5 10 20 
Accuracy (r7q) 0.46 0.50 0.63 0.76 
N = 100,S =2 
Sires tested 25 20 10 5 
t 1.74 1.64 1.27 0.83 
AG 0.80 0.82 0.80 0.62 
N = 200, S = 4 
Sires tested 50 40 20 10 
z 1.80 1.69 1.33 0.90 
AG 0.83 0.85 0.84 0.68 
N = ~, N/S = 50 
Sires tested 12.5S 10S 5S 2.58 
7 1.86 1.76 1.40 0.97 
AG 0.86 0.88 0.88 0.74 


AG passes through a maximum when 7 is in the region of 5. Suppose 
we now turn to other values of N and S, but keep the ratio of N/S 
constant. The values of 7 and AG are slightly altered in each case, 
because for a given proportion of sires selected, 7 increases a little as 
the number selected from increases. The position of the maximum 
is little changed. It seems then that with a considerable gain in gener- 
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ality, we can treat the value of N as very large and use as our new 
variable N/S, the total number of progeny recorded for each group 
that is eventually selected. This we shall call the testing ratio and 
give it the symbol K. It is convenient here to introduce a further 
variable, p, the proportion of sires selected. As the number of sires 
tested each generation is N/n, it follows that 


P= N/n 


K 
orn = Kp. 

In terms of K and p, the problem turns out to have a general solution. 
To calculate 7, we assume that we are selecting from a large sample, 
so that 


where z is the ordinate of the normal curve at the point where the area 
cut off = p. 


Similarly, 
Inh? 
+ — Ih’ 
=Nn+a 
p+a/K 
Then 


p Vp + 


Thus the expected superiority of the chosen sires is a function of 
p and K/a. For any value of K/a, we are concerned with finding the 
value of p which maximises AG. It follows that the optimum value 
of p will be a function only of K/a, as will the maximum value of AG. 
We have then introduced a much greater generality into the problem. 

We have now to find the maximum value of AG or of (z/p) 
-Vp/(p + a/K). Itis algebraically much simpler to find the minimum 
of the square of the reciprocal of this, [p(p + a/K)]/z’. In this, we 
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must remember that dz/dp = x, the abscissa of the normal curve at 
the point defined by p. 


p(p + | _ a 
dp al (2p ) ol + dp 


1 


at the minimum. 


This simplifies to 


We have thus equated K/a to a function of p, easily calculated from 
tables of the normal distribution. We can thus determine the optimum 
value of p in terms of K/a, and therefore also the maximum value of G. 

These are shown in Figures 1 and 2, in which K/a is plotted on a 
log scale. Figure 1 shows the optimum value of p and Figure 2 the 
maximum value of AG. Several interesting points emerge from a study 
of these. The first is that p increases as K/a declines but approaches 
asymptotically the value of 0.27 (where 2px = z). Thus, for the 
optimum running of any such scheme, the intensity of selection between 
tested sires must be at least one in four. At values of K/a greater than 
3, it is found empirically that the curve approximates closely to 
p = 0.28 Va/K. Substituting n = Kp, this leads to the expression 
n = 0.56 V K/h’ as the optimum value for group size. 


0204 


O10 


4-h 


FIGURE 1 
Tue Optimum Proportion or Sires SELEcTED (p) As A Function or Kh?/(4 — h?) 


As might be expected, AG,,,. increases as K/a increases. Above a 
value of K/a of about 5, it becomes a straight line in the plot against log 
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K/a. This fact, which has thus far defied algebraic check, means that 
a given proportionate size in K/a will produce the same arithmetic 
increase in the superiority of chosen sires. 


FIGURE 2 
Tue Expecrep GENETIC Superiority oF SELECTED Sires (IN TERMS OF o, IN THE 
Population) As A Function or Kh?/(4 — h?) 


Tolerable Limits of Group Size 


We have so far determined the group size which will give maximum 
superiority of chosen sires. It is useful to inquire further into the 
limits of variation in group size which can be tolerated before the 
genetic superiority falls off markedly. This can only be done empirically 
by drawing out curves of AG against n for different values of K and 
of h®. Figures 3 shows the curves for K = 100 for different values of 


os 


° a so 
GROUP SIZE 


FIGURE 3 
Tre Variation oF GENETIC SUPERIORITY OF SELECTED SrrEs witH Group Size 
(K = 100, h? 
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h®. For simplicity, they have been drawn as continuous functions. 
They all must intersect the AG axis at n = 0 and n = K (the latter 
implying p = 1, no selection). This imposes limitations upon the shape 
of the curves and on the optimum value of n. Thus in this case over the 
range of heritabilities from 0.01 to 0.25, AG will be within 10 percent 
of its maximum over the range of n from 10 to 26. For the comparison 
of the algebraic results with the actual curves, the three curves corre- 
spond to the points A, B, and C in Figures 1 and 2. 


20° 


K = 1000 


h=0-25 


° 25 so 
GROUP SIZz 


FIGURE 4 
Tue VaRiATION oF GENETIC SupERIORITY OF SELECTED Sires witu Group Size 
(h? = 0.25, K VartaB.e) 


Figures 4 shows the set of curves for h® = 0.25 and different values 
of K. AG will be zero for all when n = 0, but the separate curves will 
again be zero at different points (when n = K, in fact). 

There is thus not the same restraint in the position of the maximum. 
For h? = 0.25, there is a short range, in the neighborhood of n = 12, 
for which AG is within 10 percent of its maximum value over these 
ranges of K. But for lower values of h’, these tolerable ranges of n no 
longer exist. 


TABLE 2 
Tue Rance or Group Size Givinc GENETIC IMPROVEMENT WirTHIN 10 
PERCENT OF THE Maximum For A Herirasiiity Rance 0.01-0.25. 


Testing ratio K Group size 
50 6-11 
100 10-26 
200 20-40 


1000 60-90 


Fic 
10 
k = 100 
i 
K = 50 
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Rather surprisingly, it seems that tolerable values of n are depending 
much more on the testing ratio K than on h’. In fact, useful ranges of 
nm can be given in terms of K which will give genetic superiority of 
chosen sires within 10 percent of the maximum value over the h’ range 
0.01 — 0.25. Table 2 gives these ranges for certain values of K. The 
preferred values within the range will increase as h* decreases. 


The Effect of Non-Genetic Differences between Groups 


The case which we have so far discussed assumed that the variance 
component between progeny groups was entirely genetic in origin. In 
many practical instances, however, this condition is not satisfied and 
there are environmental effects common to all members of a group. 
This reduces the effectiveness of selection as it introduces a non-genetic 
element into the between-group variance which cannot be reduced by an 
increase in group size. Formally we may describe this as a proportion 
(c’) of the total variance, which is common to all members of a group. 
We have then the expression for ry¢ . 


1+ i(n — + 4c’) 


J he J + 4c’) 
NA? + 4c? N1 + — 1)(h? + 4c’) 


We may then generalise the presentation given above to include 
this case by writing 


in the determination of optimum structure and by multiplying the 
expected improvement by the factor Vh7/h? + 4c’. As a’ will be less 
than a, it follows that for a given K the value of p at the optimum will 
be less than in the simpler case so that the optimum group size will be 
decreased and the number of sires tested increased. The generalizations 
we derived earlier about the minimum intensity of selection of 1 in 4 
for such a programme will therefore still hold. The tolerable limits 
of group size will still hold, except that the critical range they refer to 
will be of h? + 4c’ instead of h’. 

Examination of the expression for r;g shows that it falls con- 
ceptually into two parts, if we introduce the idea of the breeding value 
of a sire under a repeatable but not measurable environmental bias. 
The first term, VW/h?/(h” + 4c’) , is then the correlation between the true 
breeding value and the actual value under that bias, and the second, 
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in(h? + 4c’) 
1+ — + 4c’) ’ 


is the correlation between the biased value and its estimate, which has 
been subject to further error by sampling within the group. 


Discussion 


The treatment so far presented has been in terms of a progeny- 
testing programme. Exactly the same formulae will of course hold in 
half-sib family selection. In full-sib selection, the formula will have 
to be slightly modified. We should then write a = (2 — h’)/h’, rather 
than (4 — h’)/h’ in the half-sib case. We have presumed that the family 
size is variable at will. This of course is not so and the logical limitations 
in this are in some cases critical. Full-sib selection in cattle is obviously 
absurd and in poultry, full-sib family size is limited by the need to 
have only a limited hatching period. These factors, rather than the 
considerations presented here, will then be of overriding importance. 

It may be of interest to illustrate this approach with two practical 
examples. The writer is associated with a progeny-testing scheme in 
dairy cattle at an artificial insemination centre. The essential fact 
of the scheme is the breeding of new bulls for testing from the best 
available proven sires. In each generation cycle, two sires (S = 2) will 
be selected on the basis of a total tested sample of 300 heifers (V = 300) 
so that K = 150. If we assume h’ = 0.25 (a = 15) we find a K/a value 
of 10. Referring to Figures 1 and 2, we find that at the optimum, 
p = 0.10 and G = 1.2¢,. Thus we should test 20 bulls on 15 daughters 
each—the actual intention is to test 12 on 25 daughters, because of the 
cost of buying and keeping too many sires unemployed. As g, is in the 
neighborhood of 80 gallons, the selected sires should show a genetic 
superiority of some 100 gallons, which in their daughters would be seen 
as a production superiority over their herd mates averaging 50 gallons. 

Secondly, let us consider a flock of 500 hens to be improved by 
half-sib family selection, in which we take 5 selected sires as the lower 
limit possible. This gives K = 100 and, if we assume a heritability 
of egg production of 0.05, we have a K/a value of 1.25. At the optimum, 


this gives p = 0.19. We should therefore test 26 sires a year on 19 
daughters each. 


Summary 


A general procedure is presented for the problem of optimum group 
size in progeny-testing and family selection programmes. This involves 
the testing ratio, K, the ratio of the number of individuals which can 
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be measured each generation to the number of sires (or family groups) 
that are to be selected. The optimum structure of the population and 
the probably genetic superiority of chosen groups are then dependent 
only on K, the heritability, h*, and the genetic relationship within 
groups. The treatment has also been extended to cover problems 
arising from non-genetic differences between groups. 


| | 
| 
| 
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PARTITION OF EXPERIMENTAL VECTORS CONNECTED 
WITH MULTINOMIAL DISTRIBUTIONS 


L. C. A. Corsten 
Institute for Research on Varieties of Field Crops (I.V.R.O.) 
Wageningen, The Netherlands 


1. Introduction 


The investigation of variates in contingency tables (which need 
not be orthogonal) often gives rise to a partition of the experimental 
result in components each illuminating one aspect of the problem in 
question. This is here expressed in terms of vectors. Compare Kuiper 
[9]. 

Our presentation is perhaps more transparent than previous papers 
dealing with similar subjects: Fisher [3], Fog [4], Irwin [6], Lancaster 
[10], [12], [13]. Moreover our method is general; hence it can be applied 
also to more intricate cases. Our results can be applied in genetics, 
which in fact motivated our research. This is why we give an intro- 
ductory section (Sec. 2) on genetics. 


2. Genetics 


We consider externally perceptible properties of individuals (plants, 
animals), like the colour (green or yellow) of seeds, or of eyes (brown or 
blue) of men. Each property to be considered is determined by genes 
of only one locus, dominant A and recessive a, such that the only 
possible distinction with respect to that property is dominant (AA or 
Aa) or recessive (aa). 

We assume that the choice of a paternal gamete by a zygote is 
stochastically independent of the choice of a maternal gamete. More- 
over we assume equal viability for every combination of gametes. 

Then it follows that crossing individuals Aa with each other will 
produce zygotes having the dominant and the recessive form of the 
property with probabilities ? and } respectively. 

We consider a second property and locus with genes B and b. If 
this locus is on a chromosome different from that carrying A and a, 
the dominance or recessivity will be independent for the two considered 
properties. 
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Crossing such individuals Aa/Bb with each other yields zygotes 
with probabilities as mentioned in the following table: 


(1) 
B 
16 16 
4 
b 16 16 


where the capitals and lower case denote the dominant and recessive 
phenotypes respectively. The column and row totals represent the 
chances of the dominant and the recessive forms of the first and the 
second property respectively. 

If both loci occur on the same chromosome and crossing-over does 
not take place, the gametes of an individual Aa/Bb are only AB and 
ab with equal probabilities. Crossing such individuals with each other 
produces zygotes for which the following table of probabilities appears: 


_|A 
B| 2 0 (2) 
b| 4 


In a similar way crossing individuals Aa/bB with gametes Ab and aB 
yields: 


_|A a 
B\ (2’) 
bi i 0 

In both (2) and (2’) the column and row totals are the same as in (1) 

and they have the same meaning. 

If both loci occur on the same chromosome and crossing-over takes 
place, the gametes of an individual Aa/Bb (in the coupling phase) are 
AB, ab, aB and Ab with probabilities, say, } — 3W, 3 — 4W, 4W, iW 
respectively (with 0 < W < 4). Crossing such individuals with each 
other yields zygotes to which the following table of probabilities corre- 
sponds: 


A a 


i(—W*+ 2W) (3) 
b| -W?+2W) i(W* — 2W + 


In a similar way individuals Aa/bB (in the repulsion phase) where 
crossing-over takes place will have gametes AB, ab, aB, and Ab with 
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probabilities 3W, 3W, 3 — 3W, and } — 3W respectively. The follow- 
ing table of probabilities for zygotes yielded by such gametes corre- 
sponds to this situation: 
| A a 
B|i(W* +2) — (3’) 
b | — W’) 


In both cases (3) and (3’), the column and row totals are the same as 
in (1), (2), and (2’). If W = 0 (crossing-over does not occur), (3) 
reduces to (2), and (3’) reduces to (2’): complete linkage. If W = 3 
both (8) and (3’) reduce to (1): the two properties are completely in- 
dependent. 

Increasing linkage (i.e. decreasing W) appears with respect to 
tables like (1) as an increase of the numbers in one diagonal and a 
decrease of the numbers in the other diagonal, row and column totals 
remaining constant; the contribution from linkage to each cell of the 
table has the same absolute value. 


3. Definition of components of a two by two table 


We. consider a scheme of probabilities like (1), (a scheme with 
probabilities p, and qg, for rows and p, and q, for columns and inde- 
pendence of these probabilities). This scheme (vector) is represented 
by 


ive 
Qip2 192 


and it is called the basis of the one-dimensional space of levels. 
It is our purpose to compare a scheme of counts arranged in a two 
by two table: 


| where P+qa=1 (i = 1, 2) 


4 
z= | where >> 2; =n, 


i=1 


or rather of the relative frequencies x;/n, with such a level. The ex- 
perimental result may suggest that the independence is satisfied, that 
also the proportion pz : g2 is not a bad description, but that the pro- 
portion p, : q, is wrong. 

Then the following scheme may be more likely: 


+a)p. (p+ 
(q = = @) qo 


: 
7 
| 
| 
| 
| 
| 
1. 
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wherein a is a suitably chosen real number. Also in this scheme the 
probabilities for the columns are p, and g, respectively and there is 
independence between row and column probabilities. The scheme is 


equal to 
hag | P2 
W192 —P2 —Qe 


We call 


a row effect and 


the basis of the one-dimensional space of row effects. 


Analogously, 
ha 
a 


is called the basis of the one-dimensional space of column effects. 
It may also happen that the independence is satisfied, but that 


both proportions p, : g, and pz : gz are wrong, and that at first instance 
the following scheme seems to be more plausible: 


+ a)(~2 +8) + — 
— a)(p2 + B) — — B) 


The chances for the rows are (p, + a) and (g, — a) respectively, 
those for columns (p, + 8) and (q. — 8) respectively, the independence 
being maintained. But if, in accordance with the customary practice 
in the analysis of variance, we postulate additivity of row and column 
effects, the following vector appears: 


+ aps + Bp. + aq. — 
— + BQ: 192 — ag. — BQ 


(4) 


(5) 


Pa 
—P2 
| 
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In this scheme row and column totals are as desired, but the vector 
(4) is equal to (5) plus 


—aB a8 
The remark in Sec. 2—last paragraph—may be expressed as follows: 


Table (3) can be written as the sum of (1) and 


1 
16 


and Table (3’) as the sum of (1) and 


1 
16 


So it is plausible to consider a vector 


-1 1 

as representative of linkage, or of disturbance of independence, or, in 
terms of the analysis of variance, as interaction. Therefore we call the 


vector 
—1 1 


the basis of the one-dimensional space of interactions. We remark that 
in contrast with row and column effects, this basis does not depend on 
the form of the level. 

In (4) and (5) we saw that, if both row and column effects are 
present, and moreover additive, a disturbance of independence with 
respect to the scheme (4) considered as level occurs. The disturbance is 


If a and @ are small, this interaction is very small and even absent if 
a and/or 8 are zero. Then this interaction is negligible in comparison 
with possible interaction from linkage. 


4) 
ib? 
aB —aB 1 —1 
= af 
| 
i 
| 
| 
| 
| 
i 
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With these remarks in mind it will be our problem to determine the 
coefficients in the following partition: 


= 
where yu, a, 8, and y are unknown. 


4. The Partition 


The conventional solution of u, a, 8, and y is not simple, because 
the basis vectors on the righthand side of (6) are in general non-orthog- 
onal. Orthogonality, i.e. ordinary inner products being zero, implies 
that the components can be obtained from orthogonal projections. 
Orthogonality can be obtained with a suitable choice of the following 
equivalent partition: 


Lo 
B= 
Cap. Dad —Cp, —Dq 


A —B A - 
Cau —Da —C D 
The four components are orthogonal, if the 6 inner products, of 
which two turn out to be identical, vanish: 


+ B’pg2 — — = 

A*pip. — B’pig2 + C’qip. — = 

— B’pig2 — + = 
A*p, — + C’p, — = 
A*p, + B’p, — — = 0. 


The third equation minus p, times the fourth equation, and the third 
equation minus p, times the fifth equation, yield C? = (q./p,) D® and 
B? = (q:/p,)D’ respectively. Substitution in the third equation 
yields: A* = (q:92/p:p2)D°. If we choose D’? = 1/q,q2 , we obtain : 

: 
MP2 


(7) 


= 
PiP2’ PiQe2 


(6) 
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and these values happen to satisfy all equations. 
Substituting the values for A, B, C, D in (7) we get: 


Xo 


V UG 


MV QUP2 NV 


Pi +8 Pe V Dip2 V 


or putting: = , Pide = Qi Po = 73, and Gig. = m4, We obtain: 


+a 


xy XL. | 


Vrs | — Va = 
(8) 
+ B V7 Va + Va Va 
V P2492 | V | — 


The four new vectors (in brackets) are perpendicular to each other, 
indeed (7,7, = 273). They are moreover unit vectors. The required 
coefficients are thus simply the inner products of the vector [on the left 
hand side of (8)] to be projected, and the vector on which it is projected. 
For example, uv is obtained by taking the inner products of both sides of 
(8) with the unit vector of which u is the coefficient. In this way we 
obtain: 


ll 


+ t+ +2) = i, 


1 
[(a, + Xo) Qi — (a3 + 
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or 


a= (a, + (2, 
n 


In a similar way: 


B= (x, + Xs) Qo = (22 + Po 


n 


and 


— — + LsPiP2 


n 


Substituting these values in (8), subtracting the first term on the right 
from both sides and multiplying both sides by Vn, we obtain: 


— NT, Le — Ne 
V NT, Vnr, 


— Ge — NM, 


V 


4 + Qo — (4%. + Ls)Po] = 
Va. 
4 (119192 — — + LspiPo) Vas 
V = V7, 
5. Statistical considerations 
x has a multinomial probability distribution: 
n! Zs Zs Tk 
where 
k k 
t=1 
or P(x) = (10) 
n 


(> z;)! 


In Sec. 4 we had the case k = 4 and E(z,;) = nz; . 


% 
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The same probability (10) will be obtained if we formally assert that 
the x; have independent Poisson distributions with \; = nz, and 
probabilities \j‘e~‘/x;! under the condition that their sum is equal 
to > 2; ; for the sum of Poisson distributed variables has also a Poisson 
distribution with parameter >*), (in this case equal to )-nz, = 7), 
and its probability occurs in the denominator of P(x), as is proper in 
the case of a conditional probability. 

The same formal assertion implies that X; = (4; — na;)/V nr; 
has mean zero and unit variance. If further nz; is sufficiently large 
(e.g. > 9), the distribution of X, will be approximated by the normal 
(0, 1) distribution. 

The vector X that can be represented as a point in a k-dimensional 
space K, has thus approximately a probability density 


however, with the restriction 


k k k k 

or + =n, o >, =0. 

In other words X is situated in the (& —1) dimensional subspace of K, 


perpendicular to the vector (Wz, , °°: » 
Thus we find the (known) result that 


— 


has approximately a x’-distribution with (k —1) dimensions (or degrees 
of freedom). 

In Sec. 4 we succeeded in splitting up X (situated in a 3-dimensional 
space) in three perpendicular components each of which has a special 
meaning. From the foregoing it follows that, if H(x;) = nm, , the square 
of every component vector—which is equal to the square of the co- 
efficient belonging to it in (9)—has approximately a x*-distribution 
with one dimension. 

In the light of our definitions in Sec. 3, we can even say that the 
square of the length of the first component has a one-dimensional 
x’-distribution, if only row effect is absent irrespective of whether 
the other effects are present or not. The same is true for the second 
component if only column effect is absent, and the same for the third 
if there are no interaction and not too large simultaneous main effects. 
In other words, each of these components affords us a specific test 
criterion for the null hypothesis that there is no row effect, or no column 
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effect, or no interaction which can be tested independent of the validity 
of the other two hypotheses to a certain extent. The last restriction — 
concerns the simultaneous occurrence of considerable row and column 
effects which influences the component for interaction. The numerical 
value of each of these three statistics is thus obtained by splitting up 
the known test criterion X° for goodness of fit in three terms, the first 
of which is equal to 


+ Lo) qr — (x3 + 
the second equal to 


+ 2X3) Jo + 
NP2Q2 


and the third is the rest: 


NPiNP2Je2 


Just as is the case in tests with X’, the critical region in the one- 
dimensional x*-distributions will be a one-sided upper critical region. 


Example (Fisher [3]). Counting of seedlings of self-fertilized maize, 
which was heterozygous Aa/bB (i.e. in repulsion phase) with respect 
to two properties, viz. starchy versus sugary and green versus white, 
gave the following results: 


| starchy sugary 


green 1997 904 
white 906 32 


If the properties are independent, then the vector level will be equal to 


3 
with p, = p, = }. 
3 1 
16 16 


The ordinary x*-test criterion for these probabilities will be found to 
be equal to 12.21 + 47.18 + 180.19 = 287.69 which is significant, 
e.g. at the 1% level. 

In order to investigate the origin of this discrepancy from our 


we = 


PARTITION OF EXPERIMENTAL VECTORS 461 


expectation (compare case (3’) in Sec. 2), we calculate the squares of 
the coefficients in (9): corresponding with row effect 


[Her + 22) — + _ [ar + — + _ 
3n 


with column effect 


3n #8; 


with interaction 


(x, — 3a. — 323 + 9a)” 
on 


which are the same as those obtained by Fisher. The sum of these 
squares is indeed equal to the value of X*. Further, we see that the 


deviation from our expectation is practically exclusively due to inter- 
action or linkage. 


= 286.27, 


In many cases we have no theoretical indications about the chances 
for rows and columns. Then (and in the case that row and column 
effects appear to be present and a further investigation of dependence 
is wished), the unknown chances are estimated according to the maxi- 
mum likelihood method assuming independence between row and 
column chances. These estimates are proportional to the marginal 
totals and imply that X” has a one-dimensional x*-distribution. 

A more elementary way of approaching this problem and its conse- 
quence for X’ and for the partition of X? runs as follows. Consider the 
set of all possible values of x/n in a 2 X 2 table with the independent 
chances p, and p, unknown and with fixed n. In the subset where the 


marginal totals are fixed, the conditional probability distribution of 
a/n will be: 


which appears to be independent of the unknown p; .. We may there- 
fore assume any p; to be true in the multinomial distribution from 
which the conditional distribution in the chosen subset can be obtained. 
It is thus permissible to assume that in the original multinomial dis- 
tribution the p,; are equal to the chosen marginal totals. We saw that 
the unconditional distribution of X with the assumed p; as parameters 


{ 
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could be approximated by the three-dimensional normal distribution. 
By imposing the conditions of the considered subset to this X, i.e. 
that the marginal totals of z/n should be equal to the assumed row 
and column probabilities, X will be limited to a one-dimensional space. 
In the subset the conditional distribution of such a X?’ will be that of 
a one-dimensional x*. Because this conditional distribution does not 
depend on the marginal totals defining the subset, it is valid in general. 
Further, it follows that by assuming the p; equal to the corresponding 
observed frequencies, the two components for row and column effects 
in the partition (9) of such a X° will vanish, and that by this very 
choice the square of the component for interaction has a one-dimensional 
x’-distribution. 

In this particular case the component for interaction can be reduced 
by substituting the marginal totals for p; and gq; to 


= Lots)” 


(a, + + + + 24) 
Or USING: — XT; = + + + — (2%, + 22) (x, + to 


E _ + + 


which will be useful later on. 


Example: If we take the same numbers as in the previous example 
and we suppose nothing to be known about probabilities, the estimates 
of expected numbers are 


709.30 228.69 


and X°® = 17.64 + 54.70 + 54.55 + 169.17 = 296.06, which is much 
more than the 0.1% point of the one-dimensional x7-distribution 
(10.827). So there is interaction or lack of independence. The fact 
that the latter X* turns out to be larger than the former should not 
surprise us, aS maximum likelihood estimators do not in general lead 
to a minimal value of x’. Therefore it is possible that in this example 
the values p, = p. = ? yield a lower value of the goodness of fit criterion 
X’ than the values of p, and p, , estimated by the maximum likelihood 
method, do. 
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6. 2” tables 


It is not difficult to generalize the treatment of 2 X 2 tables to that 
of 2" tables. We indicate the generalization by presenting the case 
n = 3. In that case the basis vector for the space of levels is defined 
to be a three-way table with given probabilities for the completely 
independent rows, columns, and layers (p; + q; = 1). The probabilities 
for each of the eight cells are then given by Fig. 1. The probability 
for row 1 (back face) is p, , for row 2 (front face) g, , for column 1 (left 
side) p. , for column 2 (right side) g, , for layer 1 (bottom) p, , and 
for layer 2 (upper face) gq; . 

We call a vector row effect if the sum of this vector and the vector 
level looks like Fig. 2. The probability for back space is p, + a, for 
front face (q, — a), for left side p, , for right side g, , for bottom p; , 
for upper face q; , all these being independent. 

We choose as a basis vector for row effects, column effects, and layer 
effects the vectors represented by Figs. 3, 4, and 5 respectively. 

We call a vector row X column interaction, if the sum of this vector 
and the vector level looks like Fig. 6. The probabilities in the six faces 
are equal to those in the vector level by itself, so that there is no main 
effect in this sum. The chances in the vertical edges are: 


6 G2 + 6 

so that independence between rows and columns is disturbed. Further, 
the probabilities for layers are independent of those for the other 
classifications. 

We choose as a basis vector for row X column interaction, row X 
layer interaction, and column X layer interaction the vectors represented 
by Figs. 7, 8, and 9 respectively. 

Finally, we call a vector second-order interaction if the sum of this 
vector and the vector level looks like Fig. 10. 

The probabilities in the six faces are equal to those in the vector 
level by itself, so that main effect is absent in this sum. The probabilities 
in the twelve edges are also equal to those in the vector level by itself, 
so that there is no first-order interaction in the sum. 

But the disturbance of independence between row and column 
probabilities in the bottom is different from that in the upper face. 
Similar remarks can be made about row and layer probabilities and 
about column and layer probabilities. In other words, the relation 


between any pair of classifications cannot be described without in- 
volving the third classification. 


is 
is 
| 
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P, 24 (pe) 4245 


Fig. 2 Fig. 3 Fig. 4 
Level + row effect Basis for row Basis for 
effects column effects 
PQ 
Fig. 5 Fig. 6 Fig. 7 Fig. 8 
Basis for layer Level + row X Basis for row X Basis for row X 
effects column interaction column interaction _layer interaction 
Fig. 9 Fig. 10 Fig. 11 Fig. 12 
Basis for column X Level + second Basis for second Example of a 2% 
layer interaction order interaction order interaction table (Roberts e.a.) 


We choose as a basis vector for second-order interaction the vector 
represented by Fig. 11. 

About the thus defined interactions, remarks similar to that in 
Sec. 3 may be made. If main effects and interactions are present, we 
have in the place of p,p2p; a chance equal to 


(pi + + B)(ps + + + + ep. + B) + +a) +n 
+ apsps + Bpips + + (eB + + (ay + 
+ (By + Dp: + (at + Be + v6 +.06y + 7). 


From this it will be seen that three main effects together introduce 
contributions to all of the four interactions. In that case certain 
first-order interactions may nullify the additional contribution to the 
second-order interaction. If there are two main effects only, there 
will be a contribution to the first-order interaction between them. If 
in this case one or both of the other first-order interactions are non- 


¥ 
Pid 
-U% 
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zero, a contribution to the second-order interaction may take place. 
If there is only one main effect and simultaneously an interaction between 
the two other classifications, there will be a contribution to the second- 
order interaction. If there are no main effects at all, the second-order 
interaction obtains no contributions from possible first-order inter- 
actions. We may conclude that disturbing contributions to inter- 
actions may be caused by main effects, but that they will be negligible 
if a, 8, and y are small. The drawback of such contributions is not 
great as it is not our intention to estimate the different effects, but to 
investigate the origin of discrepancies between expectation and ex- 
perimental result. 

It will be our purpose to split up a vector of experimental numbers 
a,/n (i = 1, 2, «++ , 8; 5; 2; = n) into eight components in the direc- 
tions of the defined basis vectors. To facilitate this partition, we again 
divide the numbers on similar places by the square root of the probability 
of that place in the vector level. Putting those probabilities equal to 
a; , we obtain the following formulation of our problem: 


x, x, 
= 
2 
H 
: ' + 
Fig. 13 
Equation (11): Partition of 2° table 
Calculate the coefficients nu, a, 8, --- 7 in the equation represented 


by Fig. 13 which will be called equation (11). All the ‘‘cubes” on the 
right-hand side of this equation are orthogonal unit vectors. 


(Observe that = = 7 = and 


a 
a 

q 
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The required coefficients are thus again the inner products of the 
vector on the left side to be projected and the vector on which it is 
projected: 


n 

g = ts + — (a + te + 
nN 


y= (x, + + + 24)qs — + % + 27 + 2s)ps 
n 


$= (a, + Xs) Qi 92 — + Le) QiPo — + + (a, + Ls)PiPo 
n 


n 


(x, + — (2 + Ls) — + Lz) + (% + 


n 


? 


1919293 — — + 
n 


92P3 — — LsPiP2Ps | 
n 


Substituting these values in equation (11), Fig. 13, subtracting the firs t 
term on the fight from both sides and multiplying on both sides by ~/n, 
we get on the left a vector X consisting of numbers (4; — nz,)/-V nr; 
the square of which is again equal to the well-known goodness of fit 
criterion. X° has approximately a x’-distribution with 7 degrees of 
freedom, if the null hypothesis that the vector level contains the prob- 
abilities for each cell is true. Further, the seven components of X on 
the right are perpendicular, and X’ is equal to the sum of the seven 
squares of these components. Each of these squares has under the 
null hypothesis a x*-distribution with one degree of freedom. 

Now again every component can serve as a statistic for testing a 
specific hypothesis. With the first three components we test a = 0, 
8 = 0, and y = 0 respectively, with the following three components, 
the hypotheses 6 = 0, « = 0, and ¢ = O, under the condition that a8, 
ay, and By are negligible respectively, and with the last component, the 
hypothesis 7 = 0, under the condition that af + Be + yé + afy is 


| 
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negligible. Each of these hypotheses is tested on the assumption of 
independence of the three classifications. 

The mentioned restrictions do not trouble us if no theoretical chances 
are available. Generalizing the considerations at the end of See. 5, 
we take the p,; equal to marginal totals with the consequence that 
a=6=y7=0. Ifany product af, ay, or By appears to be not negligible 
and a further investigation of interactions is required, it is recommended 
to take the relative p; from marginal totals. 

A more difficult situation will arise if a first-order interaction appears 
to be considerable or if it is expected in advance that any pair of classi- 
fications is not independent. This situation is obviously contrary to 
the assumption of independence of the three classifications and will be 
considered in See. 8. 


Example: (Roberts, Dawson, and Madden [14].) Crossing mice AaBbCc 

with aabbcc gave numbers represented in Fig. 12, (A in back face, 

B in left side, and C in bottom). In the level p; was 3 (¢ = 1, 2, 3). 
The value of x’ for row X layer interaction, e.g. is 


and for second-order interaction is 
NP1 Qi P292P3s n 
The seven values for x’ are: for row effect 1.63 
column effect 0.67 
layer effect 1.03 
row X column interaction 0.13 
row X layer interaction 4.25 
column X layer interaction 0.13 
second-order interaction 2.79 
total 10.63 


The one-dimensional x* has at the 5% level of significance the 
critical value: 3.84; the 7-dimensional x’: 14.07. The total x’ is not 
significant. Concluding that the row X layer interaction is significant, 
would be rash: the probability that at least one of seven one-dimensional 
x’ is larger than 4.25 is about 0.24. So there is only a slight indication 
that linkage between A and C may exist. 

It is not always possible to attach a simple meaning to a second- 
order interaction, but in cases like this it could be caused by the fact 


4 


i 
i 
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that one of the eight gene combinations, as a consequence of diminished 
viability, is much less frequent than is expected on account of main 
effects and first-order interactions only. 

About the calculations we can remark the following: x’ for row X 
column interaction can be determined as the interaction x” in the two 
by two row-column table which can be formed by adding along the four 
vertical edges; the formula for 6 is then the same as that for y in a two . 
by two table. Thus one can calculate this x’ again from the test criterion 
X? for this two by two table and by subtraction of the main effects 
for rows and columns. The second-order interaction x’ is found as 
the difference of the test criterion X°* for the 2° table and the sum of 
those for main effects and first-order interactions. 

If the probabilities p; are not known, we take them such that the 
main effects equal zero. In our example we then obtain the value x’ for 


row X column interaction 0.12 
row X layer interaction 4.34 
column X layer interaction 0.14 
second-order interaction 2.69 

total 7.29 


The 4-dimensional x’ has at the 5% level of signicance a critical 
value: 9.49. The total x’ is not significant. The probability that at 
least one of four one-dimensional x* is larger than 4.34 is about 0.14; 
so there is a slight indication that linkage between A and C may exist. 

An example of the partition of a 2° table in 31 components is given 
by Haldane [5]. 


Xn X tables 


In some particular cases a partition of m X n X --- tables (and 
even of non-orthogonal tables) may have sense. We indicate the case 
of a 2 X 3 table which can be generalized easily. 

An inquiry into the attitude with respect to a political proposal 
may be summarized in a table of experimental counts: 


for against no opinion 


men Ze zs 
women % Zs Ze 


/ 
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In this case an appropriate definition of a vector level will be: 


where pp+q=1, (@=1,2,3). 


The chances for rows (men and women) are p, and q, respectively, 
those for columns 1 and 2 together (politically interested), and for 
column 3 (politically not interested), p. and g, respectively, and those 
of columns 1 and 2, p.p3 and pog, (i.e. ps; and g; under the condition of 
being in column 1 or 2). Moreover, all these probabilities are in- 
dependent. Without further explanation we define as basis vector for 


row effect: 
| P2Ps 
—P2Ps —P293 — Qe 


Column effects will have two basis vectors, one corresponding with 
modifications of p, and g, , independence being maintained: 


Dis (12) 
19% 
and one corresponding with similar modifications of p; and q; : 
O a O 


If two such column effects occur together, viz. 8 times (12) and y times 
(13) and, if they are additive, we obtain: 


By 

P2 % 0 
less than a level with probabilities p, , p, + 8, ps + v/p2, ete. With 
respect to this level there is a deficit of By/p. times the column effect 
(13). As 8 will be small in comparison with p, , By/p. , however, will 


be negligible in comparison with 7, and even will be zero, if 8 or y is 
zero. Similar remarks can be made about simultaneous occurrence of 


This is: 


7 
vat 
{ 
4 | 
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additive row and column effects; there will be disturbances of independ- 
ence which will be negligible in comparison with proper interactions. 
We choose as basis vectors for interactions: 


1 

which represents the disturbance of independence between row prob- 
abilities, p, and g, , and column probabilities, p, and q, , and 


| 1 -1 | 

—1 1 0 

which represents a similar disturbance of independence between row 

probabilities and column probabilities, p, and g;. It will be our purpose 

again to split up a vector of experimental numbers 2;/n (¢ = 1, 2, --- 6; 
x; = n) in six components in the directions of the defined basis 

vectors. We divide again the numbers on similar places by the square 


root of the probability of that place in the vector level. Putting those 
probabilities equal to 7; , we obtain: 


Vas 


2s Le 
nV 


+ a Vr Vrs 

B pi gaps — 
Vet Vand — Van. 

| —V aps 0 

+ V 92Ps V 1929s 


(14) 


+ € —V Ups 
V | 


V 9 


\ 
5 
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The six new vectors (in brackets) on the right are orthogonal unit 
vectors. The coefficients are obtained again by taking inner products: 


I 


«2 
ll 


1 


Il 


n [219193 — — + Xsprps]. 


Substituting these values in equation (14), subtracting the first term 
on the right from both sides and multiplying on both sides by ~/n, we 
get on the left a vector X consisting of numbers (x; — nm;)/-V nm; , the 
square of which is again equal to the goodness of fit criterion. Under 
the null hypothesis that the vector level is true, the square of each 
term on the right has a x’-distribution with one degree of freedom. 
It will be remarked that the x’ for row effect, the first column effect, 
and the first interaction can be obtained as the x’ for the similar com- 
ponents in a 2 X 2 table which is deduced from our table by. amal- 
gamating the columns 1 and 2. The x’ for the second column effect 
and the second interaction are equal to 


+ — + and (219193 — — + 
P3Qs 


respectively. They will be obtained as the x” for column effect and 
interaction in the 2 X 2 table consisting of the first and second column 
[see equation (9)], with the restriction, however, that we do not use the 
experimental total, x, + 2, + 2, + 2; , in the denominator, but the 
expected total np, . 

If there are no theoretical values p; , then we take them such that 
main effects are absent, i.e. from marginal totals (these are also maximum 
likelihood estimations). Then the x’ for the first interaction will be 
found as the ordinary x’ test criterion for the 2 X 2 table obtained by 
amalgamating the first two columns and the x’ for the second inter- 
action as the x’ for interaction in a 2 X 2 table consisting of the first 
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two columns, but with row and column probabilities estimated from 
the marginal totals of the whole 2 X 3 table. In this case np, is of 
course taken equal to z, + 7% +%,+2;. 


Example: The inquiry mentioned in the beginning of this section may 
have the following result: 


for against no opinion 
men 1154 475 243 
women 1083 442 362 


(These numbers are taken from Introduction to the Theory of Statistics 
by A. M. Mood, page 273, where they occur as an example of a 2 X 3 
contingency table.) Theoretical probabilities being absent, we calculate 
the two-dimensional x” for the whole table with the aid of expected 
values obtained from marginal totals: 


1114.04 456.67 301.29 
1122.96 460.33 303.71 


and we find 26.78. The critical value of a two-dimensional x’ is 9.21 
at the 1% level. So we conclude that there is association in our numbers. 
In order to investigate the origin of this association, we calculate the 
x’ for the first interaction, which is found as the ordinary x’ from the 
amalgamated table: 


opinion _no opinion 


men 1629 243 
women 1525 362 


and with expected values also obtained by amalgamation of the calcu- 
lated expected values: 


1570.71 301.29 
1583.29 303.71 


and we find 26.77. By subtraction it follows that x’ for the second 
interaction is equal to 0.01. We conclude therefore that there was a 
difference in interest between the sexes, but that no difference in attitude 
against the proposal could be detected between men and women. 


= 
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In general each one-dimensional x’ in a contingency table, where 
the probabilities for rows and columns are estimated from the data 
and where a similar partition takes place, is connected with one of the 
2 X 2 tables which are obtained by successive amalgamations of the 
data. Such a x’ is calculated as the ordinary x’ for interaction (as 
derived in Sec. 5) in this 2 X 2 table of which the table of expected 
values is obtained by a similar amalgamation of the complete scheme of 
expected values (estimated by the maximum likelihood method). The 
last results where theoretical chances are unknown are the same as 
those of Lancaster [10] and Kimball [8]. In connection with the fore- 
going it may be remarked that the method, suggested by Lancaster 
[11] as an exact one, for calculating x’ in a contingency table where 
cells with small expectations are pooled, does not seem to be correct. 

When pooling of cells takes place it is not correct to say only that 
one or more one-dimensional x’ reduced to zero. If, e.g. in a 3 X 3 table, 
two cells in the same row are pooled, in other words are conceived as 
one cell, these cells do not contribute information about the estimation 
of the probabilities for the columns to which these cells belonged before 
pooling. For the vector level can be described as: 


PiP2Ps | PiPeIsPs | 


Pid2Ps | PideQsPs | 


MPs 193 


with p; + g; = 1 @ = 1, 2, 3, 4). 

The estimation of p, and p, takes place from row totals in the same 
way as in the 3 X 3 table. Similarly, the p, will be estimated from the 
totals of the first column and the total of the second and the third 
column together. But p, will be estimated from the proportion of the 
totals in what has been left of the second and the third column, namely 
in the first and the second row. In the expressions for several of the 
one-dimensional chi-squares that do not vanish by pooling, the 
(estimated) values of p, and q, will in general be different from the 
values obtained by estimation in the complete 3 X 3 table. 

The same conclusion follows from the fact that as a consequence of 
such pooling in the complete partition of a 3 X 3 table according to 
main effects and interactions, the basis of column effect with respect 
to p, will coincide with that for interaction in the 2 X 2 table formed 
by the second and third column on the one hand, and by the sum of 
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the first and second row and the third row on the other hand. Thus 
not only this interaction but also the mentioned main effect should 
vanish in the partition after pooling, in order that X°, which measures 
the discrepancy with respect to a level, as represented here, contain 
three interaction components only. 


8. Further remarks about 2° tables 


Several authors (Kendall [7], Simpson [15]) warn against amalgamat- 
ing 2° tables to 2 X 2 tables, even when second-order interaction 
happens to be absent. They demonstrate the possibility: 

(a) that interactions, which in each of the amalgamated classes 
separately tend in the same direction, seem to be absent after amalga- 
mation, or 

(b) that the amalgamated classes separately do not show dependence 
between the two other properties, but that they do together. In our 
opinion this warning is exaggerated in many cases and danger threatens 
from another direction. In our view, as has been shown in Sec. 6, 
dependence between two classifications in a 2° table will just be tested 
in a 2 X 2 table obtained by amalgamation in the 2° table, irrespective 
of whether second-order interaction is present or not. 

While second-order interaction is absent, the case (a) may be con- 
structed by adding a level, a row X layer interaction, and a column X 
layer interaction. In the bottom of the table this appears as adding 
of a level and suitable row and column effects in a 2 X 2 table so that 
a small interaction occurs in it (see Sec. 3). In a similar way a small 
interaction which will have the same direction appears in the upper 
face. However, in the 2 X 2 table obtained by amalgamation of bottom 
and upper face, interaction will be absent. 

Also, while second-order interaction is absent, the case (b) may be 
constructed in the following way. First we form the sum of a level 
with p; = 3, a row X layer interaction, and a column X layer inter- 
action. Because the interactions in the bottom and in the upper face 
of this sum are identical, we can add a row X column interaction to it 
such that both interactions in the bottom and in the upper face separately 
vanish. The 2 X 2 table obtained by amalgamation of bottom and 
upper face shows row X column interaction of course. These dis- 
turbing interactions, however, will be negligible in comparison with 
proper interactions due to dependence. 

The danger to which we alluded consists in maintaining the hypo- 
thesis of independence in the model of a 2° table and the corresponding 
partition, although a first-order interaction appears to be considerable, 
or independence between one or more classifications can be expected 


| 
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to be impossible in advance. For the discussed x’-test and the partition 
in a 2° table—and with this remark we proceed on what we said in 
Sec. 6—are only justified if the three classifications are independent. 

In a case, e.g. like the example given by Simpson [15] where the 
result of a treatment against a disease is investigated by counting 
dead and alive in males and females, it is not obvious that the prob- 
ability of being treated is the same for males and females. 

If one first-order interaction must be taken in account, we have to 
choose a new model to test other interactions. The new model for 
this case is that fora 2 X 4 table. Let the experimental result multiplied 
by n = 2; be: 


treated not treated 
male female male female 


After the choice of a level similar to that in the foregoing section: 
PiP2QsPs 
Pideds 


the basis vectors for interactions may be: 
| 
—1 100 1 0 


P2Ps P24 q2 

—G —1 

For convenience we put the p; as unknown for the rest of this section 
so that no main effects are present. 


With the mentioned example in mind we prefer to define the vector 
level as: 


and 


119294 919394 


= 
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which is the same as that for a 2° table if p, = p;. A basis for inter- 
actions may be formed by: 


| P22 1 -1 0 

—Ps Ps qs 100 
and (15) 

° 0 1 

00 -1 1 
The first of these three vectors represents dependence of death rate 
from treatment, the second and the third represent interaction of sex 
and death rate within the treated and within the not-treated individuals 
respectively. If one prefers to consider dependence between sex and 
death rate and further dependence of death rate from the treatment 
within the sexes separately, one should only transpose the second and 
third column in each of the four vectors. We proceed in the former 
version. 

The last pair of vectors can be replaced in several ways by two other 
vectors, the one representing a common interaction between sex and 
death rate in both treated and not-treated individuals, the other repre- 
senting a difference in dependence of death rate from sex between 
treated and not-treated individuals, and, moreover, such that a par- 


titioning of X° in independent components corresponds to this choice. 
We prefer: 


| PiP2d2  —PiP242 


| 1 -1 -1 1 

1 1 -1 

The first reason for this preference is that the mentioned difference in 
dependence—which may be called second-order interaction—is repre- 
sented by the same basis vector as in the 2° table. A second reason will 
be given in the treatment of the following case. A third reason will 
appear later on. The component of X° corresponding to the first vector 
of (15) can be calculated as the test criterion in the 2 X 2 table obtained 
by neglecting sex. The computation of the other components will be 
treated in the following. 


It is also possible that two first-order interactions must be taken in 


and 
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account. Let us assume also that an interaction between treatment 
and death rate exists in the discussed 2 X 4 table. We take as a new 
level: 


which is the same as that for a 2 X 4 table if p, = p, and as that fora 
2° table if, moreover, p, = p;. This level corresponds to that for two 
separate and independent 2 X 2 tables. A basis for interactions may 
be formed by the last two vectors of the set given by (15). 

This pair can be replaced again in several ways by another pair of 
vectors which express a common interaction between sex and death 
rate in both 2 X 2 tables, and a difference between such interactions 
respectively, and which admit a partition of X’ in iniapenm com- 
ponents. We choose: 


| 1 -1 -1 
1 1 -1 
The difference in interaction is expressed again by the same vector as 


in the previous cases. The component of X’ corresponding to the first 
vector of this pair is equal to: 


9sP59s) 


According to the end of Sec. 5, this may be reduced to: 


and 


(x, — + — NQiPsPs) 
+ 


As we know that a quantity like 


XL — 

V 
(with n sufficiently large) has a standard normal distribution, we may 
consider x, as a normally distributed variable with expectation np,p2p. 


and variance np,pP292)49, . The component is thus the square of a 
normally (0, 1) distributed combination of two normally (0, 1) dis- 


\ 
| 
‘ 


a 
v 
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tributed variables of the considered kind and such that the numbers 
np, and nq, have some weight in this combination, indeed, but not 
too much. This balanced combination of statistics for testing independ- 
ence in 2 X 2 tables, which is also recommended by van Eeden [2], is 
the second reason for our prefererce mentioned in the previous case, 
which is obtained by equalizing p; to p, and gq; to gq, respectively. It 
may be remarked that the chosen measure of dependence agrees with 
Kendall’s [7] quantities 6 both in the case of a 2 X 4 table and in that 
of two 2 X 2 tables. The common interaction is thus not obtained by 
amalgamating the treated and not-treated classes, but by adding the 
two values of this measure of dependence, and this as a consequence of 
choosing the appropriate model. 

The case where three first-order interactions must be taken in account 
will be considered now. If none of the faces of the original 2° table 
plays a special role in this case, we must have a definition of a level con- 
taining three interactions which is independent of the choice of the 
properties allotted to rows, columns, or layers respectively. After 
Bartlett [1] we choose a vector consisting of 7; (¢ = 1 --- 8) with 


8 

=1 and a = = Y. (16) 

We remark that this relation was also valid for the levels of the 2° 
table, the 2 X 4 table, and the two 2 X 2 tables. Conceiving e.g. 
2 = AS A Measure of interaction in the 2 X 2 


table 
"| 
T3 

as is recommended for measuring linkage, we see that the relation (16) 
involves equality of interactions in every pair of faces of the original 
2° table. The difference between the experimental vector and the 
maximum likelihood estimate of the level—which cannot be obtained 
in that simple way (i.e. from marginal totals) as in the cases discussed 
till now—will be called second-order interaction again. The corre- 
sponding X” has a one-dimensional x’-distribution because the estimation 
of the level implies the estimation of six parameters. 

The estimates #, are obtained by determination of the maximum 
of the likelihood function: 


8 
C+ 2; under the conditions (16). 


t=1 


a 


PARTITION OF EXPERIMENTAL VECTORS 479 


Differentiation of the function: 


8 


t=1 t=1 


with respect to 7; gives the equations: 


1, 4, 6, 7) 


and 


Py _ = 0, (i = 2,3, 5, 8). 
yields: = n. Putting \y = 6, we obtain: 
6 = nf, = nt, 
%3 — 6= Nf; , t%+é6=nt, 
nt, — 6 = Nits. 


From this it will be seen that second-order interaction is a multiple 


of the vector 
| | 
1 1 


here again, namely with coefficient 6/n. By substitution in (16) the 
following equation for 6 is obtained: 


(x1 + + + + 8) = — — — — 4). 


Lancaster [12] showed that the test criterion X? = (8°/n) }-8_, 1/4; 
calculated after solution of this cubic equation is asymptotically equal to 


1 (17) 


2 


Now the component of X° aimed at testing second-order interaction 
in the discussed models with no, one, and two interactions is equal to 
the same expression (17) for any n with the restriction only that the 
7; may stand for estimates of the true z; occurring in the level of the 
relative model which, however, converge to the true z, for large n. 


1 
|| 
a 
* 
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We see that the test criterion for second-order interaction belonging to 
any of the four discussed models—on condition that the underlying 
hypothesis expressed by the level is true—is asymptotically the same 
as that belonging to the following level if we observe the order of our 
treatment. This order implied that every model represented a stronger 
assumption about the z; than the following models. - 

In other words, we may conclude that Bartlett’s test of second-order 
interaction (admitting three first-order interactions) is asymptotically 
independent of whether any of these interactions is present or not; that 
our test of second-order interaction admitting two first-order inter- 
actions is asymptotically independent of whether any of these two 
interactions is present or not, but is not valid if three interactions occur 
in fact; that our test of second-order interaction admitting one first- 
order interaction is asymptotically independent of whether this inter- 
action is present or not, but is not valid if one or two of the other true 
interactions is not zero; that the test of second-order interaction as- 
suming no interactions (i.e. Lancaster’s procedure) is only justified if 
no true interaction occurs in fact; that the value of any of the relative 
statistics is asymptotically equal to those admitting more interactions 
if only the null hypothesis belonging to the first statistic is not too 
narrow in the sense that interactions are supposed zero although they 
are present. This result, an extension of Lancaster’s [12] remark, was 
the third reason for our preference in the choice of specific basis vectors. 

Finally, a remark proceeding from a consideration of Bartlett’s [1] 
numerical example also discussed by Lancaster [12]. This example 
showed a three-way classification of numbers of root-stocks according 
to time of planting (at once and in spring), to length of cutting (long 
and short), and to success (alive and dead): 


at once in spring 
long short long short 
alive 156 107 84 31 
dead 84 133 156 209 


Partition of the four-dimensional x’ corresponding to a 2° table yielded 
95.58 for interaction between time of planting and success, 45.40 for 
interaction between length of cutting and success, 0.00 for interaction 
between length of cutting and time of planting, and 0.07 for second- 
order interaction. In connection with these large interactions, Bartlett’s 
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and Lancaster’s criteria are not equivalent and they will not be expected 
to be equal. If we formally follow the procedure discussed in this section, 
a new model, assuming two first-order interactions, would be needed 


for a further investigation of second-order interaction. We would 
consider these two 2 X 2 tables: 
alive dead 
long short long short 
at once 156 197 84 133 
in spring 84 31 156 209 


The two-dimensional x’, 7.41 (the sum of 6.50 and 0.91) could be par- 
titioned in 5.26 for interaction between time of planting and length of 
cutting, and 2.15 for second-order interaction. Bartlett’s criterion was 
equal to 2.27, so that the two criteria do not differ much now. The 
difference could be ascribed to a (formal) interaction between time of 
planting and length of cutting. 

But in our opinion the whole procedure (also Bartlett’s) seems to be 
wrong in this special example. For equality or unequality of depend- 
ence in the two considered 2 X 2 tables has no practical sense and will 
not be an object of investigation in this case. Moreover, the fact that 
the number of root-stocks is equal for all treatment combinations— 
which led up to an interaction x’ exactly equal to zero—suggests that 
these numbers were not random but fixed before the execution of the 
experiment. An interaction between time of planting and length of 
cutting must thus be excluded from the model. For that reason we 
have already referred to this interaction with the term formal. 

In this and in similar cases we have to consider four independent 
binomial distributions defined by four chances z, , in this example, 
chances of alive according to: 


at once in spring 
long short long short 
alive ™ ™2 
dead 1 1 — We 1 1 
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Here interactions are to be defined again. In the particular case where 
the numbers in every column are equal (as in this example), say n, the 
experimental result may be partitioned as follows: 


This partition corresponds to a partition of the three-dimensional 
test criterion for independence in a particular 2 X 4 table in three 
independent components. The second vector on the right represents 
interaction between time of planting and success, and the third vector, 
interaction between length of cutting and success. The sum of the 
first three vectors on the right is proportional to: 


i.e. a vector where 7, — = 1; — 71 — 7, = — 7m, and similar 
relations between 1 — 7; are valid. Such a vector where the (positive 
or! negative) raising of the chance of alive or of dead by long 
cutting is the same for both times of planting, and where this raising 
by planting at once is the same for both lengths of cutting, seems to be 
a natural definition of the hypothesis no second-order interaction for 
this case. The fourth vector will represent second-order interaction, 
i.e. inequality of the mentioned raisings. The corresponding partition of 
X”’ is for this example numerically equivalent to that at the beginning 
of our remark about it. If the numbers in the columns are not equal, 
a first-order interaction, e.g. between time of planting and success, 
can be tested in the 2 X 2 table obtained by neglecting the other classi- 
fication (length of cutting). 

A test for second-order interaction will imply then (and also when 
the partition described shows considerable first-order interactions as 
in this example) a maximum likelihood estimation of the 7; under the 
hypothesis of “‘no second-order interaction,” i.e. 7; + 7, = m2. + 73. 
To that end we determine the maximum of the likelihood function: 


C+ % log + 


i=1 


4 
>> (a; — log (1 — under that condition. 
t=1 


° i 
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Differentiating the function: 
4 4 
> 2; log 7; + (n; — log (1 — + AG, — — 13 + 
t=1 i= 


with respect to z; yields four quadratic equations in 7, , 72 , 73 , and 
a, respectively. The usable solutions (0 < 2; < 1) are: 


_A=m+ =m)? + 


A 


_A+tm— VA+n)* — 
= 


_A+ns — VA +13)" — 425 
2X 


m+ VA — + 


A 


Substitution in the relation between the 7; yields the following equation 
for 


V(n. — + + +d)? — 
+ V(nz3 +d)? — + — + = Don, 


i=l 


from which a solution different from the trivial solution \ = 0 is required, 
unless an approximate solution which may be given by 

+ Ne Ns M4 


a(n; — 
i=1 n; 


and which can be improved by usual methods, is exactly zero. In 
that case the solution of \ is zero. 

In Bartlett’s example this approximate solution of \ was equal to 
4.91. This could be improved to 4.92078. Solving the #,; with the help 
of this value gave the following table of expected numbers: 


105.78 82.89 
82.89 134.22 157.11 208.44 


The one-dimensional x” for second-order interaction according to our 
definition of no second-order interaction appea’ed to be equal to 0.082. 


i 
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1. Introduction 


The validity of the hemacytometer method for estimating the 
density of erythrocytes depends upon several factors. Among these, 
it is necessary to assume that the aliquot of blood taken from the 
organism is truly representative of the total blood mass. In addition, 
it must be assumed that the erythrocytes in the aliquot become 
dispersed completely at random over the surface of the counting 
chamber. The manner in which the erythrocytes randomly distribute 
themselves into the various square divisions of the chamber grid becomes 
of interest by virtue of the fact that knowledge of this distribution can 
yield information concerning the precision of the estimated number of 
cells. Abbé [1878] seems to be the first to suggest that there is a relation 
between precision and the magnitude of the count itself. He proposed 
using as a measure of precision the square root of the mean number of 
cells per division as a consequence of the distribution law which he 
derived. This distribution law was in fact identical to the well-known 
Poisson distribution [1837]. Student [1907] independently derived 
Poisson’s distribution in regard to counting yeast cells. 

It is well known that the mean number of cells per division is a 
good estimate, in fact the maximum likelihood estimate, of the variance 
of a Poisson distribution; so that the standard deviation might simply 
be taken as the square root of the mean as Abbe proposed. However, 
in more recent years, Berkson [1938] has calculated the mean and 
variance of a number of cases of actual red blood cell counts made 
under very excellent counting conditions and has shown that the variance 
is almost invariably smaller than the mean. Berkson tested the equality 
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of the mean and variance, implied by the Poisson distribution, on this 
data by the use of Fisher’s [1922] chi-square test. 

Lancaster [1952] investigated the use of this chi-square criterion 
for controlling counting and found it quite adequate to the task. 
Berkson and his associates discuss the adequacy of the Poisson model 
in a number of papers [1935, 1936, 1938, 1939, and 1940]. Whitaker 
[1914] and Student himself [1919] both give illustrations of the inade- 
quacy of the Poisson in some particular instances and show that in 
many of the common applications of the distribution the underlying 
assumptions are not even approximately met. 

Lancaster [1950] attacked the problem of crowding of the erythro- 
cytes which had been noted by Berkson, et al. [1935]. He derived the 
consequences of crowding upon the relationship of the variance to the 
mean but did not offer a complete distribution law to substitute for the 
Poisson distribution. 

Plum [1936] should be consulted for the early history of the subject 
and Lancaster for later developments [1950]. 


2. The Poisson-Binomial M odel 


It is a well known fact that the Poisson distribution can be con- 
sidered to be a limiting form of the binomial distribution. This latter 
distribution can be derived for the hemacytometer situation in the 
following manner. 

Let us suppose that there are a total of r blood cells which finally 
come to rest under n squares of the hemacytometer grid. We would 
like to know in what manner these r cells will distribute themselves into 
the squares of the grid. In other words, what is the probability that 
there will be no cells in a given square; what is the probability that 
there will be one cell in the given square; or two cells in the given square; 
or three cells; or in general & cells in the given square? Now if we 
assume that all of the r cells are distinguishable and that all n” possible 
arrangements are equally likely to occur, then the probability of obtain- 
ing exactly k cells in a given square can be found. 

It is seen that this probability P(k) is simply the 


No. of arrangements providing k cells in the given square 


Total no. of possible arrangements 


Now, there are (") ways of selecting k cells out of r cells and for each 


k 
of these ways there is (n — 1) 


r—k 


ways of arranging the remaining 


Sng: 
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r — k cells into the remaining n — 1 squares. Thus, 


n 
This is the general term of the series for the binomial [1/n + (n — 1)/n]’. 
In the hemacytometer situation n = 400 if we confine our attention 
to the smallest subdivisions in one large square of the standard hemacy- 
tometer. Each of these smallest squares has an area of 1/400th of a 
square mm. The total number of cells under the grid, r, is usually of 
the order of one to three thousand. With values of n and r in this range 
it can be easily demonstrated that the general term of the Poisson 


distribution 
1 k 
ew {-2(2) @) 


can be used as a quite good approximation for P(k). Now it is well 
known that the variance of the binomial distribution is rpq which in 
the notation of this paper becomes 


but this is, to a close approximation when n = 400 just r/n, which is, 
in fact, the mean. It is also true, as previously mentioned, that the 
variance of the Poisson distribution is equal to the mean. Thus, it 
can be seen that if this binomial-Poisson model were appropriate we 
should expect the estimated mean and the estimated variance to be of 
equal magnitude on the average. This is not the case, as Berkson 
showed. The difficulty can be traced to the assumptions utilized in 
deriving the model. We assumed that all »’ possible arrangements are 
equally likely. This implies that there is no crowding; that is, if a cell 
occupies a square, that it is just as likely that another cell would come 
to rest in that square as come to rest in any other square, one, say, 
that has not been previously occupied. Now, this is a. reasonable 
assumption so long as a square is not nearly filled with cells. However, 
with the range of r usually encountered in red cell hemacytometry 
and erythrocytes being as big as they are, it happens that a number of 
squares get quickly filled and this results in a more even dispersion of 
cells throughout the counting chamber. This, obviously, results in a 
lessened observed variance than would be predicted by the binomial- 
Poisson model. 
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3. An Improved Model 


We will discuss in this paper a model which takes into account the 
above-mentioned crowding due to the finite limitations of space in the 
counting chamber.* A derivation can be made as follows. 

Let us suppose that N is the maximum number of cells that can be 
accommodated under the entire grid, assuming no over-lapping of cells. 
We denote the maximum number of cells which can be accommodated 
under one square of the grid as a. Obviously a = N/n since each of 
the n squares has equal area. We desire the probability, P(k, , k. , 
+--+ , k,) of obtaining exactly k, , k. , --- , k, cells in the n squares of 
the grid. 

Now, the total number of ways in which the r cells can be arranged 


in the N available spaces is (*), since each space can contain only one 


cell. We assume that each of these ways has an equal chance of occurr- 
ing. The number of ways that k, cells can be arranged in the a spaces 
available in the first square, k, cells in the second, and so on is 


Then, the desired probability, of obtaining exactly k, , k, , --- , kn 
cells in the n squares is given by 


where the sum of the k’s is equal to r. 

It is tacitly implied by this derivation that all of the r cells are of 
the same size. Of course, erythrocytes, like all biological products, 
vary somewhat in size; however, it seems likely that the model (4) 
will still serve as a reasonably good approximate model. The adequacy 
of the approximation, as in all scientific models, must be determined 
by the ability of the model to describe approximately all relevant 
details of the observational record. 


*We are obligated to Dr. E. J. Williams for suggesting the model discussed here. The authors 
had employed the hypergeometric distribution given by 


in an earlier draft of this paper. This latter was not so general in that the correlation between squares, 
which had been deemed negligible, was not taken into account. Formula (5) below was also provided 
by Dr. Williams. 
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It can be shown that the expected value of the estimated variance 
of k; (@ = 1, 2, --- , n) is given by 


2 N n 2 


where u is the expected mean, r/n. When N is large, the coefficient 
of » in the first term of formula (5) becomes close to unity so that the 
variance can be approximately expressed by 


(6) 


This is to be compared with the corresponding expression for the Poisson 
distribution, 


(7) 


Formula (5) should be compared with the quadratic relationship derived 
by Lancaster (1950) by a non-distributional argument. In his approach 
to the crowding problem Lancaster was working with larger subdivisions 
of the hemacytometer grid than we have employed and his derivation 
was geometrical in nature. It is interesting that these two independent 
developments have produced theoretical relationships between the 
mean and variance of the same algebraic form. 


4. Test of the Model 


Formula (6) provides a convenient way of testing the adequacy of 
the suggested model. We do not know the value of N but the constant 
n/(N —1) can be estimated from actual data by fitting the quadratic 
equation (6). Figure 1 shows the data of Berkson [1935] with the best 
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fitting (in the sense of least squares) quadratic curve drawn through 
the points. The straight line drawn at 45° from the axes represents 
the relationship between the mean and variance expected on the basis 
of the Poisson model. The description of the data seems to be quite 
good. 

A similar series of observations was made at Duke University 
under conditions rivaling those of Berkson in accuracy of counting. 
These data are presented in Figure 2, along with the Poisson line and 
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RELATIONSHIP BETWEEN THE VARIANCE AND 
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the quadratic relationship for the more general model. Again, the 
description appears to be not at all bad. 

Both the series of Berkson and the Duke series represent obser- 
vations on samples of blood from different individuals. In Figure 3 
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is presented a series of observations made at Duke collected in a different 
manner. Each mean and variance is based on counts of erythrocytes 
for the same blood sample. The mean number of cells per square has 
been varied by using various degrees of dilution of the single sample 
of blood. The dilutions used were 1:250, 1:225, 1:200, 1:175, and 1:150. 
Two preparations were made at each of these dilutions. Each of the 
ten resulting means and variances is based upon 208 of the small squares 
instead of 400 as in the two previously described series. The description 
by the quadratic curve would be quite good if it were not for the two 
high values. Perhaps these two values represent just random variation, 
but we are inclined to think that their departure from expectation may 
be due to disturbances in the chamber filling process. As we intend to 
show elsewhere, the variance of the hemacytometer count can be 
increased to a considerable extent by interruption in the filling of the 
counting chamber so that a homogeneous distribution of cells in the 
chamber does not result. 


5. Size of the Erythrocytes 


We have attempted to test the tenability of the new model by 
comparing observation with expectation as regards one particular 
consequence of this model, namely, the expected quadratic relationship 
between the variance and the mean. We propose the conclusion that 
the results are not, on the whole, at all inconsistent with the model. 
It certainly appears that the new model provides a better description 
of the presented data than the Poisson model. There is another conse- 
quence of the suggested model which can easily be put to a test. We 
have mentioned the fact that it is the effect of crowding which diminishes 
the variability of the number of cells per square. Now, crowding can 
be produced in two different ways. We can either increase the number 
of cells which compete for the available spaces or we can increase the 
size of the cells. The data previously discussed demonstrate the effect 
of crowding by increasing the number of cells. We now consider the 
alternative situation, the effect of cell size. 

First, we note that it is possible to estimate N directly by use of 
the following relationship which is derived from formula (5) by re- 
arranging terms: 


2 2 
= (8) 
where we substitute the observed mean and variance for o”? and uw. This 


is, of course, the “moment” estimate and is, perhaps, not the most 
efficient estimate of N. However, since the maximum likelihood 
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estimator is considerably more difficult to apply, we will use this simple 
estimator to illustrate our point. Now, N is by definition the total 
number of available spaces, so that it is related inversely to the average 
cell size. If we know that we have a sample of blood containing large 
cells we should expect our estimate of N to be smaller than that for 
normal sized cells. The reverse, of course, should hold true if we have 
a sample containing smaller than average cells. Failure of the results 
to comply to this expectation would throw some suspicion on the 
adequacy of the model. 

A sample of blood was obtained from a patient with microcytic 
anemia and from one with macrocytic anemia. The number of cells 
per square was counted for each of the 400 small squares and the mean 
and variance were calculated. These statistics were obtained for both 
patients. N, the number of available spaces, was estimated by formula 
(8) and the results are given in Table 1. The average cell diameter 


TABLE 1 
Patient No. of Mean No. Estimated Mean 
squares ofcells Variance N cell diameter corpuscular 
counted (in microns) volume 
Microcytic 400 5.548 5.045 25,100 6.3 65 
Macrocytic 400 3.420 3.242 14,900 8.2 104 


can be approximated from N (and hence from the mean and variance) 
by assuming that the cells each occupy a square with sides equal to 
the diameter of the cell. Thus, the cell diameter in microns is approxi- 
materly given by 


Cell diameter ~ 1000/-V/N. (9) 


The mean corpuscular volume has been computed from the hematocrit 
and cell count for comparative purposes. We are not proposing calcu- 
lation of the cell diameter from the mean and variance as a practical 
procedure; but it appears to us to yield an interesting confirmation of 
the essential validity of the model. The actual average cell diameters 
are not known but the values computed above are of a reasonable 
magnitude for patients of these two types. 


6. Discussion 


The combined effect on the variance of crowding by increasing the 
size of the erythrocytes, which is equivalent to decreasing N, and of 
increasing the density of erythrocytes in the counting chamber, may be 
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FIGURE 4 
RELATIONSHIP BETWEEN THE VARIANCE AND MEAN 
CELL COUNT FOR A VARIETY OF HYPOTHETICAL CASES 
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seen from Figure 4. The range of values, both as to numbers and size, 
is considerably larger than is encountered in actual practice. However, 
this extension of the range makes possible a portrayal of the full impli- 
cations of the suggested model. For example, it is apparent that as 
the density of cells is increased and/or the size of the cells is increased, 
the variance of the count increases to a point, and then with further 
increase of density or size, the variance declines and eventually becomes 
zero when all of the chamber squares are filled to a maximum and hence 
all contain the same number of erythrocytes. The line corresponding 
to N = o is the degenerate case, the Poisson distribution. When N 
is infinite, then the cells must occupy no spaces at all, and hence no 
crowding occurs. Thus, by considering the Poisson as a limiting case 
of the suggested distribution, we realize why this simple distribution 
does not suffice to describe the distribution of such relatively large 
objects as erythrocytes when in considerable density. 

The reader may wonder why we have chosen to test the theory by 
such indirect measures as those we have described. Why not simply 
test the goodness of fit of the theoretical distribution with the observed 
distribution? This was actually done, but it turned out to be a quite 
insensitive mode of discrimination. It happens that several different 
distributions, including both the Poisson and the suggested distribu- 
tion, yield fits to the data which are all quite good by the chi-square 
goodness of fit criterion. However, it is not the overall fit, but rather 
the precision of the count with which we are primarily concerned. It 
is this specific aspect of the description wherein the new model appears 
to be an improvement over the Poisson. 

We will terminate this discussion by a few comments as to the 
practical implications of the above theory. We were motivated in this 
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investigation by the desirability of being able, in actual practice, to 
place confidence limits on cell counts without having actually to bother 
with making a separate estimate of the variance. By assuming that the 
cells were distributed in the counting chamber according to the Poisson 
Law, we were able to do just that. In this case the variance is taken to 
be the count itself. Now, if instead we assume that the law (4) holds, 
then it is not possible to know the variance from the count alone. In 
addition, we must know the size of the erythrocytes; however, this 
information is in general not known. Fortunately, there is a very 
simple solution to this dilemma, one that is apparent from the graphs 
of the relationship between the variance and mean. If thé cell sus- 
pension is sufficiently diluted, then the limiting Poisson distribution 
becomes an adequate approximation to the true distribution and the 
variance of the count can be taken to be the count itself. 

Another solution might be considered, but turns out to be of no 
value for our purposes. This solution would be to increase the density 
of cells to the point that the variance approaches zero. Certainly, we 
should desire the variance to be as small as possible. However, the 
effect of this procedure is to destroy the representativeness of the 
sample in that there would be a piling-up of cells and thus our necessary 
condition of randomness would be lost. 


7. Summary 


Thus, we finally arrive back where we started via a rather devious 
route. We began with the empirical evidence against the use of the 
Poisson distribution. Then we derived a new model with somewhat 
less unrealistic assumptions about the physical system we were studying. 
We saw that this latter model fairly well represented a variety of actual 
data. Then we considered from an intuitive point of view the known 
fact that the Poisson is a limiting form of this latter distribution. We 
then concluded that the Poisson can be made to apply to red cell 
hemacytometry by making greater dilutions of the cells than is at present 
generally done. This conclusion is justified in two different ways. 
First, with sufficient dilution we can use the Poisson distribution; and, 
second, it seems that there is less chance of disturbing the randomness 
of the sample by eliminating a tendency to pile-up. 
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THE COMPARISON OF THE SENSITIVITIES OF SIMILAR 
EXPERIMENTS: APPLICATIONS* 


R. A. BRADLEY AND D. E. W. ScouMaAnn** 


Virginia Agricultural Experiment Station of the Virginia Polytechnic Institute, 
Blacksburg, Virginia, U. S. A. 


INTRODUCTION 


The problem of deciding on the best method of measuring the 
effects of experimental treatments is a difficult one in certain areas 
of research. We are concerned with means of comparing the sensi- 
tivities of two available experimental techniques, where we use the 
term in a very general sense, that might be used to yield data for 
specified treatment comparisons. It is often not possible to determine 
on a priori grounds which of two techniques should be used and it may 
be desirable to use both techniques in separate experiments with a 
view to deciding on the technique to be used in future experimentation. 
When the two experiments are indeed independent of each other in 
probability, the methods of this paper may be used. 

Let us consider several typical situations indicated by subject- 
matter headings. 

Taste Testing: Hopkins [1950, 1953] investigated the use of discrete 
scoring scales in taste testing experiments as they are commonly used 
and Baten [1946] suggested the use of a continuous line scoring system. 
The latter obtained results which he interpreted as showing an increase 
in sensitivity for his method over the use of discrete scores. Procedures 
discussed in this paper provide means of comparing the sensitivities 
of the two scoring systems if data were available from two identically 
designed but statistically independent experiments, one for each 
scoring system, in a form suitable for analysis of variance. 

Many other comparable situations arise in taste testing. The 
sensitivities of two taste panels, two methods of preparation of test 
items, two environmental situations during tasting, or two methods of 
storage may be compared. 


*Research sponsored by the Agricultural Research Service, U.S.D.A. under a Research and 
Marketing Act Contract, No. 12-14-100-126(20). 
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Chemistry: Consider the analytical chemist with two series of re- 
actions based on the use of different reagents available for a quantitative 
analysis of a particular element in a compound. His ultimate objective 
is to run comparative experiments on the concentrations of this element 
in samples of the compound obtained from a number of suppliers. Two 
similar experiments might be conducted, one for each analytical method, 
to determine which method is to be used in the required routine experi- 
ments. Other problems of this nature arise frequently in chemistry 
and all of the physical sciences. 

Life Testing: Means of conducting accelerated life tests are being 
devised in electronics, food processing, nuclear physics, and in a host 
of other research endeavors. In some cases it may be possible to 
compare experiments based on accelerated tests with experiments 
depending on normal life tests. In other cases alternative means of 
conducting accelerated tests may be available and comparisons of their 
sensitivities in the sense of emphasizing real treatment differences are 
required. 

Agronomy: One of the difficult areas of agronomic research is in 
pasture experimentation. Yields of various forage mixtures may be 
measured through clipping of plots in some experiments while in others 
animal performance or carrying capacity may be taken as the measure 
of yield. While care should be taken to determine that these yield 
measurements may be transformed to comparable units, the need for 
comparison of the sensitivities of techniques relative to the exhibition 
of differences between measurable forage characteristics is again apparent. 

Cochran [1943] discussed the problem of comparing different scales 
of measurement for experimental results. He assumed that analysis 
of variance techniques were applicable and confined his attention to the 
case in which all scales measure the same experiment. It was noted 
that a comparison of the sensitivities of two scales should depend both 
on the experimental errors associated with them and on the magnitudes 
of the treatment effects in the scales. Cochran suggested that a com- 
parison of this sort should depend on a test of significance of the hypoth- 
esis that the parameters of two noncentral variance-ratio distributions 
areequal. Schumann and Bradley [1957] have provided means of making 
such tests of significance for comparisons of the sensitivities of two 
identically designed independent experiments. The main results are 
then for identical experiments in the sense that both experiments yield 
F-tests with the same degrees of freedom and with treatment means 
based on the same number of replications although generalizations are 
indicated. We shall summarize that work in the next section and devote 
the rest of this paper to numerical illustrations. 
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We may have somewhat generalized Cochran’s concept of comparing 
scales of measurement in discussing comparisons of experimental 
techniques. Now some caution is in order in interpreting the results 
on comparisons of experiments. The test procedures developed are 
correct whenever two independent variance ratios are compared. How- 
ever, the experimental techniques under comparison should be such 
that they do not interact with the treatments under comparison in a 
way not explainable in terms of a shift in treatment locations and a 
magnification of differences in treatment effects. It would be a good 
rule to forego experiment sensitivity comparisons if there is evidence 
of treatment by experiment interaction of a nature other than that 
permitted in view of our preceding statement. 

Many experiments are conducted with a view to examining inter- 
actions. To illustrate, suppose that two independent experiments in 
agriculture differ only in geographical location. Then it is more likely 
that the experimenter will be interested in treatment by location 
interactions than in location sensitivity differences to treatment effects. 
The important applications of our methods seem to be in comparing 
experiments based on experimental techniques differing essentially in 
means of measurement which do not interact fundamentally with the 
treatments under comparison. It is of course obvious, if two experi- 
mental techniques are to be compared, that the comparison should 
not be confounded with other experimental changes. 

The method for the comparison of the sensitivities of experiments 
may also be used to compare two population multiple correlation 
coefficients whose sampling distributions are based on regression 
models. An example of this method will be given. 


DISTRIBUTION THEORY 


The following summary of the distribution theory for the comparison 
of the sensitivities of two experiments is based on the work of Schumann 
{1956] and Schumann and Bradley [1957]. 

Let F, and F, be independent noncentral variance ratios, each 
with 2a and 2b degrees of freedom and with parameters of noncentrality, 
d, and A,. Then their distributions are of the form 


+ b, a, ad,F,/b(1 + < Fy < = 1,2, 


(1) 


where B represents the beta function and ,F, , the confluent hyper- 
geometric series. In Model I of the analysis of variance with so-called 
“fixed” parameters in the additive model, if 7,; is the effect of the j-th 


| 
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of ¢ treatments in experiment 7 and if o7 is the 7-th population experi- 
mental error variance, 


t 
4 =k t+=1,2, (2) 
j=1 
when k is the number of observations in each treatment mean. These 
parameters of noncentrality are then measures dependent upon both 
the experimental error associated with the experiment and the magni- 
tudes of treatment effects in the experiment. When ; = 0, f(F;) in 
(1) becomes the density function of a central variance ratio with 2@ 
and 2b degrees of freedom. The specification of degrees of freedom 
as 2a (for the numerator of F;) and 2b (for the denominator of F;) 
was for mathematical convenience; a and b may be integers or half- 
integers. 
We required the distribution of 


(3) 


and its density function is 


g(w; a, b, ) = 


r=0 rls! 


[Ba + r, Baa +s, 


when 
=X 


and where 


H(w;r,s) = [ wy) eg) dy, 
O<w<o, 
= 
2a + 2b+r+s,1— 0<w<l, 
= w +r + s, 2b) 
2b+r+s, (w — 1/v], 


2, in (6) is the hypergeometric function. It has been noted that 
g(w; a, b, \) has a mode at a value of w between 0 and 1, a median at 


| 
| 
| 
(4) 
(5) 
(6) 
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w = 1 and that lim g(w) = 0asw—0andasw— o. Further, we 
use the fact that 


g(w; a, b, A) = g(1/w; a, b, d). 


We have shown that the distribution function 


(7) 


G(wo ; a, b, = P(w < b, = [ g(w; a, b, ») dw 
0 
may be approximated by the distribution function, 
G(wo; a’, b, 0) = [B(a’, [ + + 


(8) 
[4 _ = Nb = _ 
a + 1)(1 + wy) © 2a’ + 2)(1 + woy)” 


where 


a’ = (a + d)*/(a + 2d). (9) 


The suitability of this approximation was determined on the basis of 
comparing moments of g(w; a, b, \) and of g(w; a’, b, 0) and a limited 
number of values of G(w» ; a, b, A) and G(w, ; a’, b, 0). 

A table of values of wo(.05) for which G[wo(.05); a, b, 0] = .95 was 
prepared by Schumann and Bradley [1957, Table I] for integer and 
half-integer values of a, a = 1, --- , 21/2, 15 and for integer values 
of b, b = 1, --- , 10, 15, ~. This table is reproduced here as Table 1 
for the convenience of the reader. The table is essentially triangular 
for G(wo ; a, b, 0) = G(w. ; b, a, 0). It is apparent from the table that 
values of G(wo ; a, b, X) and G(w, ; a’, b, 0) are fairly stable even for 
considerable variation in values of \ and for quite small values of a 
and b. This implies that it will have little effect in applications if we 
enter the table using a value of \ somewhat different from its true 
(and usually unknown) value. Linear interpolation in Table 1 will 
be satisfactory in applications. 

In usual multiple regression theory with non-stochastic independent 
variables, if we fit two regression equations of the form 


y=at bat + bz, (10) 


to two independent sets of data and obtain corresponding multiple 


correlation coefficients, R, and R, , based on N observation vectors 
each, 


F, = bRi/a(l — 7=1,2 (11) 
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have distributions of the form (1) with 
Ay = (a + d)pi/(1 — 
a = p/2 (12) 
b = (N — p— 1)/2 


where p; , 7 = 1, 2, is the population multiple correlation coefficient 
corresponding to R; . It then follows that the procedures developed 
apply here and that we may perform tests of hypotheses on multiple 
correlation coefficients dependent on those procedures. It is common 
practice to use R’ as a measure of the fraction of the variability in the 
dependent variable explained by regression. It will be useful in many 
cases to be able to test hypotheses on the equality of pj and p; . 

Note that the application to comparing multiple correlation co- 
efficients is suitable for the regression model (10) but does not apply 
to correlations based on models assuming a multivariate normal distri- 
bution for vectors (y, x, , --+ , x,), for then the distribution (1) is not 
correct. 


TEST PROCEDURES 


(t) Tests of hypotheses on sensitivity 

In tests of sensitivity we shall use the statistic w = F,/F, defined 
in (3) and on the assumptions stated earlier that F, and F, are in- 
dependent, have 2a and 2b degrees of freedom, and have noncentral 
F-distributions with parameters \, and A, . wW(a) is defined to be 
such that G[w,.(a); a, b, 4) = 1 — a. Equality of \, and implies 
equal sensitivities, for the two experiments so long as k in (2) is the 
same for each experiment, that is, so long as treatment means are 
based on equal numbers of replications. 


Test i. H, = = A, > (13) 


The null hypothesis H, represents the hypothesis of equal sensitivities 
for the two identical experiments. The alternative hypothesis H,, 
assumes that we are interested in the one-sided test perhaps because 
we shall not change to the method of Experiment 1 from the currently 
used method of Experiment 2 unless the former is more sensitive. 
Given a, we obtain wo(a) and reject H, in favor of H., at significance 
level a when w > w,(a). 


Test 2. H, ™ = A, < (14) 


This test is identical with Test 1 when the identities of the experiments 
are interchanged. 


: j 
) 
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Test 3. H, = r, re (15) 


The two-sided test is of course similar to Tests 1 and 2 and would 
be used when departures from H, in either direction are of interest. 
For significance level 2a, reject Hy in favor of H,, when w > w(a) 
or when 1/w > w,(a). 

(it) Tests of hypotheses on correlation 


Testt4. Ho:pi=o, (16) 


Test5. Ho:pi=p:, (17) 


Testt6. Ha: pi (18) 


Tests 4, 5, and 6 may be associated with Tests 1, 2, and 3 respectively 
when we redefine 


w = Ri(1 — R2)/Rx(1 — Ri) (19) 


with the association of the parameters in (12). With this new definition 
of w, the test procedures are exactly those used in (z) above for tests 
of hypotheses on sensitivity. It is of course necessary that (a + b) 
in (12) be constant for the two regression experiments. 

(tit) Analysis 

The following easy basic steps are required in the tests outlined 
above. 

1. Do analyses of variance for both experiments yielding F-statistics 
valid for treatment comparisons, each with 2a and 2b degrees of freedom. 
[When analysis of variance for multiple regression is used, F = 
bR?/a(1 — R?)]. 

2. Estimate the parameters \, and X, pertaining to the two experi- 
ments. 


(a) In Model I of the analysis of variance, estimate \ for each 


experiment by taking 
2 
i= of ~ 1] (20) 


e 


where s? and s% are respectively mean squares for treatments and for 
error. The mean of 4, and }, is heuristically taken to be the “best 
available” single estimate* of the population parameter X. 

(b) In multiple regression, Snedecor [1950, p. 348] gives 


Ri =1-(1— d)/b (21) 


*The conservative statistician might take { to be the smaller of { and {2 ; the ultra-conservative 


statistician might take } = 0. For moderately large a and b, wo(a) does not change much with changes 
in A. 


| 
| 


504 BIOMETRICS, DECEMBER 1957 


in our notation and R% may be taken as an estimate of p’ required 
to estimate \ in (12). This is however equivalent to taking 


2 
af 1] (22) 
similar to (20) where s?,, and s°,, are respectively mean squares for 
regression and residual (error) variation in the analysis of variance 
for regression. Then again estimates of \, and \, are obtained from 
the two experiments and their mean is taken as 4. 

3. Formulate the null hypothesis H, : \,; = 2 = A, using X for 2, 
and the suitable alternative hypothesis selected from Tests 1-6. 

4. Decide on the significance level to be used, a for a one-sided 
test and 2a for a two-sided test.* 

5. Calculate w = F,/F, . 

6. Compute a’ = (a + 4)*/(a + 2h) using i obtained in Step 2. 

7. Obtain wo(a) from Table 1, possible when a = .05, by entering 
that table with a’ for a and with b and by using linear interpolation 
in the table. 

8. Compare w with w,(a) rejecting H, in favor of H, in accordance 
with the rules set forth for Tests 1-6. 


NUMERICAL EXAMPLES 


The following numerical examples have been chosen to illustrate 
applications of the method of comparing sensitivities of identical 
experiments in different experimental situations. In summarizing the 
analyses we follow the steps outlined in the preceding section. 

Example 1. Discriminating Abilities of Judges. 

Two judges were asked to score desirability of color of nine brands 
of canned tomatoes in two independent randomized block experiments. 
The discrete scoring scale employed had values ranging in units of 5 
from 0 to 100. Although the scale was discrete, it will be assumed that 


TABLE 2 
AVERAGE CoLor Scores FOR CANNED TOMATOES 


Brand 


Judge A (Exp. 1) 
Judge B (Exp. 2) 5. 


*At the present time we are limited to taking a = .05 due to the unavailability of other tables. 


| 2 5 9 3 
5 85.0 78.7 75.0 
0 68.7 65.0 60.0 

Brand 1 7 4 8 6 
Judge A 61.3 50.0 48.7 48.7 47.5 
Judge B 52.5 45.0 40.0 40.0 38.7 
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analysis of variance applies and the observed scores were used in such 
analyses. Judge A was to be used on a scoring panel if he demonstrated 
sensitivity to color differences as well as the experienced Judge B. 

The nine brands of canned tomatoes with the average scores accorded 
each, averaged over four replications, for each judge are given in 
Table 2. 

The analysis for comparison of the sensitivities (discriminating 
abilities) of the two judges now follows: 

1. Analyses of variance are given in Table 3. F, = 19.14 and 
F, = 5.42 while 2a = 8, 2b = 24,a = 4,b = 12. 


TABLE 3 
ANALYSES OF VARIANCE FOR THE Two JUDGES 


Source df. Mean Square 


Judge A (Experiment 1) 
Blocks 3 167.6 
Treatments 8 1134.1 
Error 24 59.3 


Judge B (Experiment 2) 


Blocks 3 346.3 
Treatments 8 761.6 


Error 24 140.6 


**Significant at the .01 level of significance. 


2. We use method (a), equation (20). 


1134.1 
i, = 11341 zs 72.5. 


761.6 
1, = 1] = 17.7. 

We take {} = 3(17.7 + 72.5) = 45.1. 

3. The hypotheses are Hy : \, = A. = A, Ha, : A, > Az as described 
in Test 1. 

4, It is specified that a = .05. 

5. w = 19.14/5.42 = 3.53. 

6. a’ = (4 + 45.1)?/(4 + 90.2) = 25.6 from (9). 

7. Wo(.05) < 2.87, obvious from Table 1 with a’ 
although a’ exceeds the tabled values of a in Table 1. 

8. H, is rejected in favor of H,, on the kasis of the test for 
w > w,(.05). This implies that Judge A is a good judge of color differ- 
ences even in comparison with experienced Judge B and Judge A may 
be included on the scoring panel. 


a 
{ 
5.42** 
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Note that if we had taken \ = 4, = 17.7, we would have obtained 
a’ = (4 + 17.7)°/(4 + 35.4) = 12.0 and w(.05) = 2.60. Our con- 
clusions would of course be unchanged. 

Examination of Table 2 suggests that the two judges are not using 
the scoring scale in quite the same way but that the difference is largely 
due to location and scale-unit differences. There is no evidence of 
judge by treatment interaction of the sort that would make interpre- 
tation of the w-test used here difficult. In most cases it would be neces- 
sary to train Judge A in use of the scoring scale consistent with that 
of the other panel members before he may be included on the panel. 

Example 2. Sensitivity of Pasture Yields to Irrigation. 

The yields of a mixture of Orchard Grass and Ladino Clover sub- 
jected to eight different treatments (eight fertilizers) were obtained 
from plots which were either irrigated or not irrigated. The experiments 
relating to irrigation and no irrigation were independent and each 
contained four replications. The average yields in pounds per acre 
for treatments A to H, averaged over the four replications, are given 
in Table 4. An objective was to learn whether the effects of differences 
in fertilization would be more or less pronounced under irrigation 
than under no irrigation. 

Many irrigation experiments of the kind described have been 
conducted at the Virginia Agricultural Experiment Station and little 
treatment by irrigation interaction has been obtained that is of a 
fundamental nature (that is, that cannot be explained by a shift in 
treatment locations and a magnification of treatment differences due 
to irrigation). If, for example, we plot pairs of observations in columns 
of Table 4, we find the points so obtained very nearly collinear and this 
is a crude but adequate check on the explanation of the treatment by 
irrigation interaction. 

Sometimes split-plot designs are used in this sort of irrigation 
experiment, but this is not usually acceptable since error variances for 
irrigated plots are often eight to ten times as large (only 2.7 times as 
large in our example) as for plots without irrigation. This same hetero- 
geneity of error variances also largely precludes the possibility of 


TABLE 4 


YIELDs oF OrcHARD GRASS AND LADINO CLOVER IN Las/AcRE UNDER 
FERTILIZER TREATMENTS 


Treatment A B C D E F G H 


No irrigation (Exp. 1) 499 627 642 756 884 494 597 377 
Irrigation (Exp. 2) 831 836 931 1137 1346 730 903 583 


a 
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looking at tables of means such as Table 4 in order to compare the 
sensitivities of the two growing conditions to treatment differences. 
The split-plot design is not necessary (whole plot treatments were 
“irrigation” and “no irrigation”) for in most experiments in most 
years huge differences between irrigation and no irrigation are found. 

The analysis for comparison of the sensitivities of no-irrigation and 
irrigation experiments to differences in fertilizer treatments is now set 
out. 


1. F, = 8.86, F, = 7.10, a = 3.5, and b = 10.5 from analyses of 
variance in Table 5. 

2. 4, = 3.5(8.86 — 1) = 27.5 andi, ="3.5(7.10 — 1) = 21.4 and we 
take } = 3(27.5 + 21.4) = 24.4 using (20) and Step 2. 


3. Hp = A. = A and H,, : \, were chosen in accordance 
with the objective of the experiments. 


4. We take a = .05 and the significance level is .10. 

5. w = 8.86/7.10 = 1.25. 

6. a’ = (3.5 + 24.4)?/(3.5 + 48.8) = 14.9 from (9). 

7. We use the result that G(w, ; a, b, 0) = G(w, ; b, a, 0) and enter 
Table 1 with a = 10.5 (in our experiments b = 10.5) and with b = 14.9 
(we obtained a’ = 14.9). Then we find that 2.36 < w)(.05) < 2.63. 


TABLE 5 
ANALYSES OF VARIANCE FOR THE Two FERTILIZER EXPERIMENTS 


Source df. Mean Square F 


Experiment 1—No Irrigation 


Replications 3 17 ,899 
Treatments 102,104 8.86** 
Error 21 11,517 


Experiment 2—Irrigation 


Replications 3 93,155 
Treatments 7 224 , 462 7.10°* 
Error 21 31,599 


**Significant at the .01 level of significance. 


8. w is not significant in comparison with w,(.05) nor is 1/w. It 
does not appear that these irrigation and no-irrigation experiments 
differ much in sensitivity to fertilizer treatment effects. (While we 
took, \ = 24.4, the same conclusion would have been reached for other 
values of \ in a fairly wide interval about 24.4.) 

Example 3. A Comparison of Two Multiple Correlation Coefficients. 

In research in chemical engineering on an extraction mechanism, 
the rate of transfer of one component of a two-component, two-phase 
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system (water and a solvent) across the interface was measured.* 
This rate of transfer was correlated with the physical properties of the 
phases. For any one system, log (k,d/D,,) or y was assumed to be 
linearly related to log (G,d/u,) or x, and log (G,,d/u,,) or x2 as a result 
of a multiplicative model based on theoretical knowledge of the mech- 
anism. The symbols used are: 


The film coefficient for transfer of water into the solvent 
phase, lb. moles per hr., sq. ft. unit concentration difference. 
: The diameter of the extraction tube, ft. 

: The diffusivity of water into solvent, sq. ft. per hr. 

: The mass velocity of the solvent phase, lb. per hr., sq. ft. 

: The mass velocity of the water phase, lb. per hr., sq. ft. 
The viscosity of the solvent phase, Ib. per ft., hr. 

: The viscosity of the water phase, lb. per ft., hr. 


For this example, we consider regression equations of the form 


y= at b,x, + dot, 


for two solvents, cyclohexanol and methyl ethyl ketone. The inde- 
pendent variables, x, and z, , are measurable with small errors that are 
insignificant in comparison with the residual variability of y. Ac- 
cordingly, the regression equations and the resulting estimates of the 


multiple correlation coefficients are appropriate to illustrate the com- 
parison described in this paper. We are interested in comparing the 
dependencies of y, essentially a transfer rate, on the physical properties 
of the phases for the two different solvents. The experimental results 
and the statistical analysis are now set forth. 

The regression equations are: 

1. Cyclohexanol, 


y = 1.0986 + 1.0931z, + 0.17362, 


and 
2. Methyl ethyl ketone, 


y = —2.8715 + 0.87127, + 0.52342, . 


The corresponding squared multiple correlation coefficients are 
R? = 0.571 and R} = 0.486 with 2 and 34 and 2 and 31 degrees of 
freedom respectively. 


The two regression equations look rather different but the objective 


*We are indebted to Prof N. F. Murphy of the Department of Chemical Engineering, Virginia 
Polytechnic Institute, for permission to use his data and for advice on the presentation of this example. 
The regression equations in this example were calculated from work on liquid-liquid extraction given 
by Murphy, Lastovica, and Skrzec [1956] which contains similar, but somewhat more complex, re- 
gression analyses. 


k, 
o 
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is simply to compare values of p’ as measures of the effectiveness of 
the regressions. This can be done here since the degrees of freedom are 
almost the same for the two studies and this is required in view of the 
definition of \ in (12). 

The analysis is: 

1. R?/(1 — R?) = 1.3310, R3/(1 — R,)* = 0.9455 with a = 1 and 
b, = 17, b, = 15.5. We take 6 = (17 + 15.5)/2 = 16.25, the average 
of the values of b for R? and R?. 


2. Using Step 2(b), formula (21), we have 
Ai = 1 — (1 — 0.571)(1 + 17)/17 = 0.546, 
p2 = 1 — (1 — 0.486)(1 + 15.5)/15.5 = 0.453. 
Then, from (12), 


4, = (1 + 17)(0.546)/(1 — 0.546) = 21.647 
and 


he = (1 + 15.5)(0.453)/(1 — 0.453) = 13.665. 


We take 


= (21.647 + 13.665) = 17.66. 


. The hypotheses are Hy : pi = pz and H,, : pi 
. a = 0.05; the significance level is 2a = 0.10. 
" 1.3310/0.9455 = 1.408 from (19). 
(1 + 17.66)?/[1 + 2(17.66)] = 9.59. 
is evident from Table 1 that w,(.05) ~ 2.67 and certainly 
w,(.05) > 2.17 when we enter that table with a = 9.59 and b = 6 = 16.25. 
8. The observed value of w, 1.408, is exceeded by w(.05) > 2.17 
and we do not reject Hy : pj = p; formulated in Step 3 above. 


DISCUSSION AND SUMMARY 


We have illustrated a method of comparing the sensitivities of two 
identically designed experiments and of comparing the squares of two 
multiple correlation coefficients from comparable regression analyses. 
In each case, the problem reduced to a test of the hypothesis of equality 
of two parameters of noncentrality in two noncentral F-distributions. 
This is satisfactory when the number of replications in each treatment 
mean is the same in each analysis of variance and when each regression 
equation is based on the same number of observations. 

Examples where these methods may be used have been pointed out 
in a general way in a number of areas of research. Three particular 
numerical examples are given illustrating the application of the steps 
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for analysis specifically set forth. Some discussion of the care needed 
in interpreting these analyses is given and in particular it is pointed 
out that treatment by experiment interactions of a fundamental nature 
may lead to difficulties of interpretation. The table available for the 
tests of significance required in analyses is somewhat limited (this is 
indicated from the way the examples are handled) but Schumann is 
planning to extend these tables and to prepare tables for values of a, 
the significance level in one-sided tests, of .025, 0.01, and .005. Neverthe- 
less, the available table with a = .05 does provide a means of making 
the tests that are often desired in experimental work. 

There are some situations wherein the sensitivities of experiments 
that do not meet the requirements on similarity set forth should be 
compared. The practical situations seem to be where the same treat- 
ments are in both experiments but where the experiments are based 
on different numbers of replications or on somewhat different experi- 
mental designs. A crude method of obtaining an approximation to 
the critical value of w in that situation based on the use of Table 1 has 
been suggested by Schumann and Bradley [1957] but its accuracy is 
not known and it is not repeated here. 

In conclusion, D. E. W. Schumann wishes to express his appreciation 
of a grant from the South African Council of Industrial and Scientific 
Research, Pretoria, South Africa, and for assistance from Fulbright 
and U.S. Department of State grants. These grants and the assistance 
of the Agricultural Research Service, U.S.D.A., made this work possible. 

We are grateful to Mrs. Sally Arnold, who did much of the computing, 
and to Mrs. E. P. Ellison, Miss Marianne Byrd, and Mrs. J. D. Hall for 
clerical assistance. 
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ERROR RATES AND SAMPLE SIZES FOR RANGE TESTS IN 
MULTIPLE COMPARISONS 


H. Leon Harter 


Wright Air Development Center 
Wright-Patterson Air Force Base, Ohio, U. S. A. 


1. Introduction 


The problem of multiple comparisons has recently aroused a great 
deal of interest among statisticians. The basic F-test in an analysis 
of variance determines whether there is a significant difference among 
a group of means, but it cannot tell which means differ significantly 
from which others. The latter is often what the investigator really 
wants to know. Various multiple comparisons tests, including the 
range tests discussed in this paper, have been proposed. A study is 
made here of the error rates, a and @, and their relation to sample size, 
N, for three fixed range tests and three multiple range tests. 

Let it be required to test the significance of the range of p out of 
m ordered means of samples of size N, where p = 2,3, --- ,m. For the 
fixed range tests, the critical range for a particular Type I error rate, 
a, depends only on m and N; for the multiple range tests, it depends 
also on p. In fact, for fixed m and N, it is a non-decreasing function of 
p. The range of p means is said to be significant or non-significant 
(at the @ level) according as it does or does not exceed the critical range, 
except that the range of p means is automatically non-significant if 
these p means constitute a subgroup of a larger group whose range is 
non-significant. Because of this exception, a multiple range test starts 
with a test on all m means. If the range of all m means is found to be 
significant, then tests are performed on the ranges of (m —1) successive 
means, (m —2) successive means, and so on until significant differences 
are no longer found. 


2. Significance vs. Confidence 


Though the emphasis here is upon tests of significance of size a 
for the difference between sample means, each of the above methods 
can also be used for setting 100 (1 —a)% confidence limits on the 
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difference between population means. The choice between significance 
tests and confidence limits depends upon the experimental situation 
and upon the experimenter’s philosophy. Many statisticians, in- 
cluding Tukey [1953], prefer confidence procedures in practically all 
cases, and the author is convinced that there is much merit in this 
position. The results of this paper, however, will be stated in terms 
of significance. The reader who prefers confidence procedures can 
easily translate from the language of significance to that of confidence. 


3. The Fixed Range Tests 


The three fixed range tests which will be considered are the least 
significant difference (LSD) test based upon the work of “Student” 
[1908], a modification of this proposed by Fisher [1935], and the student- 
ized range test proposed by Tukey [1952]. They will be designated 
by code letters L, E and W respectively. 

The differences LSD, WSD and ESD between any two out of m 
means of samples of size N required for significance (at the a level) 
by the LSD test, Tukey’s studentized range test, and Fisher’s test, 
respectively, are given by 


LSD = t(a, 2, (1) 
WSD = qa, m, »)sz , (2) 
ESD = t(@/C? , »)si = g(a/C? , 2, , (3) 


where v is the number of degrees of freedom for the error mean square, 
s’ ; t(a, v) is the 2-tailed a point (upper a/2 point) of Student’s ¢ with 
v degrees of freedom; g(a, 7, v) is the upper a point of the studentized 
range of r observations with v degrees of freedom for s; sz is the standard 
error of the mean, s; = ~/s’/N; and sj is the standard error of the 
difference between means, sz = V/2s'/N = 

Each of these three test procedures has Type I error rate a for an 
appropriate definition of the error rate. As a result of a test on the 
means of m samples each of size N, a statement of significance or non- 
significance will be made about each of the C’} differences between pairs 
of means. A wrong statement (Type I error) is a statement that two 
means are significantly different, when actually they are means of 
random samples from the same population. The appropriate error 
rates a, , aw , and az for the respective test types are then defined as 
follows: a, is the expected proportion of wrong statements (error 
rate per comparison); aw is the expected proportion of experiments 
with one or more wrong statements (familywise error rate); and a; 
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is the expected number of wrong statements per experiment (error 
rate per family). 

In practice, many statisticians (including the author) use the 
protected LSD test, in which the individual comparisons are made 
only if an analysis of variance yields a significant F-ratio for the com- 
parison of all m means. The Type I error rate, a, , which will be con- 
sidered here, however, is that for the unprotected LSD test, otherwise 
known as the multiple ¢-test. 


4. The Multiple Range Tests 


The three multiple range tests which will be considered are the 
Newman-Keuls test, Tukey’s X procedure, and Duncan’s new multiple 
range test. They will be designated by the code letters J, X and S, 
respectively. 

The Newman-Keuls test was first proposed by Newman [1939] 
and modified slightly by Keuls [1952]. The critical range of p out of 
m ordered means of samples each of size N is based on the studentized 
range of p observations instead of on the studentized range of m obser- 
vations as in Tukey’s studentized range test. It can easily be seen 
that the critical range ISD for the Newman-Keuls test is intermediate 
between the LSD and the WSD, except that it is the same as the LSD 
for p = 2 and the same as the WSD for p = m. 

Tukey’s X procedure was proposed by Tukey [1953] as a com- 
promise between the Newman-Keuls test and Tukey’s studentized 
range test. The critical range is the arithmetic mean of the critical 
ranges for those two tests. For fixed m and N, the smallest critical 
range (a value midway between the LSD and the WSD) occurs for 
p = 2, and the largest critical range (equal to the WSD) occurs for 
p= mM. 

Duncan’s new multiple range test, proposed by Duncan [1953, 
1955], makes use of special protection levels based upon degrees of 
freedom. Let y2,. = 1 —a be the protection level for testing the 
significance of difference between two means, that is, the probability 
that a significant difference between sample means will not be found 
if the population means are equal. Duncan reasons that one has (p —1) 
degrees of freedom for testing p means, and hence one may make 
(p —1) independent tests, each with protection level y.,. . Hence 
the joint protection level is 


= = (1 a)"; (4) 


that is, the probability that one finds no significant differences in 
making (p —1) independent tests, each at protection level y2,. , is 


; 
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3-2. Thus Duncan’s new multiple range test is based upon protection 
levels y»,. for tests on p means. The critical range SSD is a minimum 
(equal to the LSD) for p = 2, and a maximum for p = m. 

The critical ranges ISD, XSD, and SSD for p out of m ordered means 
of samples of size N required for significance (at the a level) by the 
Newman-Keuls test, Tukey’s X procedure, and Duncan’s new multiple 
range test, respectively, are given by 


ISD = qa, p, »)sz , (5) 
XSD = (1/2)[q(a, P; + qa, m, v) |sz (6) 
SSD = R(p, V; (7) 


where R(p, v, ¥p,2) is the studentized range of p observations with v 
degrees of freedom for s and protection level y,,. given by equation 
(4), and where the other symbols are defined as in the previous section. 


5. Example 


At this point an example will serve to illustrate the various test 
procedures. For the sake of simplicity, a single-classification experi- 
ment involving comparisons of five means will be considered. The 
data represent the weight loss (grams) in four days of rats on five diets 
(no food and 8 calories per rat per day of four foods). In order to 
reduce experimental error, only male rats were used, all rats were of 
the same strain, and their initial weights were nearly constant. All 
one hundred rats received water ad libitum, but they were randomly 
divided into five groups of twenty rats each, one group on each of the 
five diets. 

An analysis of variance of the data yields the following results: 


Source of Variation Di. S.s. M.s. F 
Among diets 4 3012.70 753.18 38. 14** 
Between diets (error) 95 1876.30 19.75 

Total 99 4889 .00 


Thus there is a highly significant difference among mean weight 
losses of the rats on the five diets. This information is quite interesting 
to the experimenter, but what he really wishes to know is which diets 
are significantly better than which others. In order to obtain this 
information, it is necessary to apply a multiple comparisons test to the 


means: 


| 
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Purina Laboratory Chow Casein Dextrose Olive Oil No Food 
23.70 24.90 28.25 29.40 39.25 


The standard error of the mean, s; , and the standard error of the 
difference between means, sz , are found to be 0.994 and 1.405, re- 
spectively, based upon the internal estimate of variance, s’> = 19.75, 
with 95 degrees of freedom. The factors by which these must be 
multiplied to obtain the critical ranges for significance at the 5% and 
1% levels can be found in tables. Tables of the critical values of the 
Student ¢ distribution can be found in almost any statistics textbook. 
The 5% and 1% points of the studentized range have been tabulated 
by Pearson and Hartley [1954], and the 5% and 1% points of 
R(p, v, Y>.«) by Beyer [1953] and Duncan [1955]. All of these tables 
have been reproduced by Federer [1955], who gives the only textbook 
treatment of the subject of multiple comparisons of which the author is 
aware. The factors by which s, = 0.994 must be multiplied to obtain the 
critical ranges for the various tests are as follows: 


Code a=0.05 »=95 a =0.01 
Letter 

L 2.81 2.81 2.81 2.81 | 3.72 3.72 3.72 3.72 
S 2.81 2.96 3.06 3.12 | 3.72 3.87 3.98 4.07 
I 2.81 3.37 3.70 3.94 | 3.72 4.22 4.53 4.74 
x 3.38 3.66 3.82 3.94 | 4.23 4.48 4.64 4.74 
W | 3.94 3.94 3.94 3.94 | 4.74 4.74 4.74 4.74 
E 4.05 4.05 4.05 4.05 | 4.80 4.80 4.80 4.80 


Multiplying the standard error of the mean s; = 0.994 by these factors, 
one finds that the critical ranges for the various tests are as follows: 


a = 0.05 a = 0.01 

Code 

Letter | p=2 p=3 p=4 p=5|p=2 p=3 p=4 p=5 
L 2.79 2.79 2.79 2.79 3.70 3.70 3.70 3.70 
Ss 2.79 2.94 3.04 3.10 3.70 3.85 3.96 4.05 
I 2.79 3.35 3.68 3.92 3.70 4.19 4.50 4.71 
X 3.36 3.64 3.80 3.92 4.20 4.45 4.61 4.71 
Ww 3.92 3.92 3.92 3.92 4.71 4.71 4.71 4.71 
E 4.03 4.03 4.03 4.03 4.77 4.77 4.77 4.77 
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Then the results of the various tests can be summarized as follows, 
where two means are or are not significantly different at the level a 
according as they are not or are underscored by the same line: 


Test a = 0.05 a =0.01 


LSD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 
SSD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 
ISD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 
XSD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 
WSD] 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 
ESD | 23.70 24.90 28.25 29.40 39.25] 23.70 24.90 28.25 29.40 39.25 


Thus, at the 5% level, the only disagreement is as to whether the 
means for casein and dextrose differ significantly. At the 1% level, 
there are disagreements in regard to the following differences: (1) 
casein and olive oil and (2) Purina laboratory chow and dextrose. For 
experiments involving a larger number of means, one would expect a 
greater number of disagreements. 


6. Type I Error Rates 


The Type I error rate, a, is defined differently for the various tests. 
The Type I error rate (as defined for a particular test) is denoted by 
a with the code letter for that test as a subscript; for example, a, is 
the Type I error rate as defined by the LSD test. A comparison of the 
error rates is in order. 

By comparison of (1) with (2), one sees that a WSD with Type I 
error rate aw is equivalent to an LSD with Type I error rate a, , where 


tar ,v) = , 2, »)/VW2 = , m,»)/V2. (8) 


Similarly, by comparison of (1) with (3), one sees that an ESD with 
Type I error rate ag is equivalent to an LSD with Type I error rate 
a, , where 


a, = az/C?. (9) 


Let a, | ay be the value of a, corresponding to a given ay , where 
M may be any one of the letters W, E, J, X, or S. Table 1A gives 
a, | aw corresponding to aw = 0.05, 0.01 for m = 2(1)10(2)20, and 
N = 2, 8, 4, 6, 10, 16, 25, ~; also a, | ag corresponding to ag = 0.05, 
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0.01 for the same values of m. (The relation between a, and ag is 
independent of N.) This table, as well as others which will follow, 
has been computed for a single-classification design, and the results 
are slightly different for randomized blocks or other experimental designs, 
due to a slight reduction in the number of degrees of freedom. 

In testing the significance of the range of p out of m ordered means 
of samples of size N by one of the multiple range tests with Type I error 
rate a (as defined for that particular test), the critical range is a function 
of m, N and p. If a, mand N are fixed, the critical range is a function 
of p only. Each critical range, for fixed m and N, corresponds to a 
particular value of the Type I error rate a, ; the larger the critical range, 
the smaller the value of a, to which it corresponds. Since the critical 
range is a non-decreasing function of p, the maximum value of a, 
corresponding to a fixed a for the multiple range test under consideration, 
with m and WN also fixed, occurs for p = 2, and the minimum for p = m. 

For p = 2, the Newman-Keuls test and Duncan’s new multiple 
range test have critical ranges equal to the LSD; hence (az) max | ar is 
equal to a, and (az) max | @s is equal to as . 

For p = m, the Newman-Keuls test and Tukey’s X procedure 
have critical ranges equal to the WSD; hence (az) mia | a is equal to 
a, | aw = and (az) nin | ax is equal to a, | aw = ax. The values of 
az, | aw = 0.05, 0.01 have been given in Table 1A. 


Table 1B gives (az) max | ax corresponding to ax = 0.05, 0.01 and 
Table 1C gives (az) min | @s corresponding to as = 0.05, 0.01, for the 
same values of m and WN as in Table 1A. 


7. Type II and Type III Error Rates 


Let my and m, be the two extreme values of p out of m ordered 
means of samples of size N. Let uy and yw, be the population means 
corresponding to my and m, respectively. Let the subscripts U and 
L be so assigned that py > uz. (This does not necessarily imply that 
My > m,.) It should be noted that, while exactly (p —2) of the 
(m —2) other sample means lie between my and m, , the number of 
other population means lying between wy and uw, may be less than, 
equal to, or greater than (p —2). Now define 6 and 6’ by the relations 


5 = (uy — (10) 
5’ = (uv — (10) 


where ¢ is the common population standard deviation and s is the 
pooled estimate obtained from the data. 


If the conclusion is drawn that my and my, are not significantly 
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different when actually uy > uz , this constitutes an error of Type II. 
If m, is found to be significantly greater than my when actually uy > uz, 
this will be called an error of Type III. Figure 1 shows how such an 


TABLE 1 


CoMPARISON OF TyPE I Error RATES FOR SINGLE CLASSIFICATION 
A. Values of a, Corresponding to aw = 0.05, 0.01 and to ag = 0.05, 0.01 


ay, | 
ay, | aw = 0.05 = 0.05 

m 
N=2N=3N=4N=6N=10 N=16 N=25 (all N’s) 
2 .05000 .05000 .05000 .05000 .05000 .05000 .05000 .05000)} .05000 
3 | .02506 .02197 .02093 .02028 .01964 .01943 .01923 .01911) .01667 
4] .01517 .01254 .01170 .01104 .01070 .01040 .01032 .01023) .00833 
5 | .01022 .00818 .00746 .00699 .00670 .00654 .00646 .00634; .00500 
6 | .00727 .00569 .00524 .00486 .00465 .00450 .00444 .00438) .00333 
7 | .00542 .00418 .00382 .00356 .00341 .00331 .00327 .00319) .00238 
8 | .00418 .00319 .00294 .00272 .00260 .00255 .00249 .00242) .00179 
9} .00330 .00251 .00231 .00216 .00202 .00195 .00194 .00191] .00139 
10 | .00269 .00204 .00185 .00173 .00165 .00162 .00161 .00157) .00111 
12 |} .00184 .00141 .00129 .00119 .00114 .00112 .00111 .00109) .00076 
14 | .00135 .00103 .00094 .00089 .00085 .00084 .00082 .00080) .00055 
16 | .00103 .00079 .00072 .00067 .00065 .00064 .00063 .00062) .00042 
18 | .00080 .00062 .00057 .00053 .00051 .00050 .00050 .00049) .00033 
20 | .00064 .00050 .00047 .00043 .00041 .00041 .00040 .00040) .00026 
az, | aw = 0.01 az | ag 
m = 0.01 
N=2N=3N=4N=6N=10 N=16 N =25 N- (all N’s) 
2} .01000 .01000 .01000 .01000 .01000 .01000 .01000 .01000) .01000 
3 | .00497 .00420 .00397 .00383 .00372 .00364 .00360 .00358) .00333 
4 | .00291 .00233 .00215 .00201 .00192 .00188 .00187 .00186; .00167 
5 | .00191 .00146 .00133 .00125 .00119 .00117 .00116 .00114) .00100 
6 | .00134 .00101 .00092 .00085 .00081 .00080 .00079 .00076} .00067 
7 | .00098 .00073 .00066 .00062 .00058 .00057 .00057 .00056) .00048 
8 | .00074 .00055 .00050 .00047 .00044 .00043 .00043 .00042) .00036 
9} .00058 .00043 .00039 .00036 .00035 .00034 .00034 .00033) .00028 
10 | .00047 .00034 .00031 .00029 .00028 .00027 .00027 .00026) .00022 
12 | .00031 .00023 .00021 .00020 .00019 .00019 .00018 .00018) .00015 
14 | .00022 .00017 .00016 .00015 .00014 .00014 .00014 .00013) .00011 
16 | .00017 .00013 .00012 .00011 .00011 .00011 .00011 .00010} .00008 
18 | .00013 .00010 .00009 .00009 .00008 .00008 .00008 .00008} .00007 
20 | .00010 .00008 .00007 .00007 .00007 .00007 .00007 .00006} .00005 


i 
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error may arise. The sum of the Type II error rate, 8, and the Type 
III error rate, y, will be denoted by §’, that is, 


Bo =B+y. (11) 
The Type III error rate, y, may be sizeable when 6’ and N are both small. 
Otherwise, y = 0 and p’ = 8B. 


TABLE 1 (continued) 
B. Maximum Values of a; Corresponding to ax = 0.05, 0.01 


(az) max | ar = 0.05 


m 
N=2 N=3 N=4 N=6 N=10 N=16 N=25 Noo 
2 -0500 .0500 .0500 .0500 .0500 .0500 .0500 # .0500 
3 -.0348 .0329 .0323 .0321 .0316 .0316 .0314 .0314 
4 .0266 .0249 .0244 .0240 .0238 .0236 .0235 .0235 
5 .0217. .0202 .0197 .0194 .0192 .0191 .0191 .0191 
6 .0183 .0170 .0167 .0164 .0163 .0162 .0162 .0162 
7 .0158 .0148 .0145 .01438 .0142 .0142 .0142 .0142 
8 .0140 .0130 .0129 .0127 .0127 .0126 .0126 .0126 
9 .0125 .0117 .0116 .0115 .0114 .0113 .0114 
10 .0114 .0107 .0105 .0104 .0104 .0104 .0104 .0104 
12 .0096 .0091 .0090 .0090 .0089 .0089 .0089 .0090 
14 .0084 .0080 .0079 .0079 .0079 .0079 ~.0079 .0079 
16 .0074 .0071 .0071 .0071 .0071 .0071 
18 .0067 .0065 .0064 .0064 .0064 .0064 ~«.0065 # .0065 
20 .0061 .0059 .0059 .0059 .0059 .0059 .0059 .0059 

(ay) max | ex = 0.01 
m 


N=2 N=3 N=4 N=6 N=10 N=16 N=25 Noo 


.0100 .0100 .0100 .0100 .0100 .0100 .0100 .0100 
.0069 .0064 .0062 .0062 .0061 .0061 ~.0061 .0061 
-0052 .0048 .0046 .0045 .0045 .0045 .0045 .0045 


9 0023 0021 0021 0021 0021 0021 0021 0021 
10 0020 0019 0019 0019 0019 0019 0019 0019 
12 0017 0016 0016 0016 0016 0016 0016 = =.0016 
14 0015 0014 0014 0014 0014 0014 0014 0014 
16 0013 0012 0012 0012 0012 0012 0012 0012 
18 0011 0011 0011 0011 0011 0011 0011 0011 


[ 
2 
3 
4 
5 .0041 .0037 .0036 .0036 .0036 .0036 .0036 .0036 . 
6 .0034 .0031 .0031 .0030 .0030 .0030 .0030 .0030 
7 .0029 .0027 .0026 .0026 0026 .0026 .0026 .0026 7 
8 .0025 .0023 .0023 .0023 .0023 .0023 
20 .0010 .0010 .0010 .0010 .0010 .0010 .0010 .0010 
a 
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TABLE 1 (continued) 
C. Minimum Values of a; Corresponding to ag = 0.05, 0.01 


(az) min | as = 0.05 


N=2 N=3 N=4 N=6 N=10 N=16 N=25 Noo 


2 .0500 .0500 .0500 .0500 .0500 .0500 .0500 .0500 
3 .0500 .0446 .0425 .0411 .0400 .0394 .0392 .0389 
4 .0467 .0397 .0364 .0359 .0344 .0333 .0331 .0327 
5 .0424 .0357 .0334 .0319 .0305 .0299 .0295 .0289 
6 -0406 .0333 .0305 .0287 .0275 .0268 .0265 .0259 
7 .0379 .0310 .0283 .0266 .0255 .0249 .0246 .0241 
8 .0360 .0291 .0266 .0247 .0238 .0231 .0228 .0224 


9 -0345 .0276 .0251 .0232 .0222 .0218 .0216 .0212 
10 -0340 .0260 .0237 .0222 .0212 .0207 .0205 .0200 
12 .0308 .0239 .0217 .0203 .0193 .0189 .0186 .0182 
14 .0282 .0221 .0201 .0189 .0180 .0175 .0173 .0168 
16 .0263 .0208 .0189 .0177 .0169 .0165 .0163 .0159 
18 .0245 .0194 .0179 .0166 .0160 .0156 .0154 .0150 
20 .0234 .0186 .0170 .0159 .0151 .0147 .0145  .0141 


(az)min | as = 0.01 
m 


N=2 N=3 N=4 N=6 N=10 N=16N= N- 


2 .0100 .0100 .0100 .0100 .0100 .0100 .0100 .0100 
3 .0092 .0080 .0075 .0075 .0074 .0074 .0074 .0072 
4 .0082 .0066 .0062 .0061 .0061 .0060 .0059 .0058 
5 .0072 .0057 .0055 .0053 .0052 .0050 .0050 .0049 
6 .0063 .0050 .0049 .0047 .0046 .0045 .0044 .0043 
7 .0054 :0045 .0044 .0042 .0040 .0040 .0039 .0038 
8 .0048 .0042 .0041 .0038 .0036 .0036 .0035 .0034 


9 .0043 .0039 .0038 .0036 .0034 .0033 .0033 .0032 
12 .0035 .0033 .0031 .0029 .0027 .0027 .0026 .0026 
14 .0033 .0029 =.0028 .0026 .0025 .0024 .0024 .0023 
16 -0031 .0027 .0025 .0024 .0023 .0022 .0022 .0021 
18 .0030 .0025 .0024 .0022 .0021 .0021 .0020 .0019 
20 .0029 .0024 .0022 .0020 .0019 .0019 .0019 .0018 


The code letters for the various tests are used as subscripts on 8 
and 6’; for example, 8, denotes the Type II error rate for the LSD test. 

The Type II and Type III error rates, 6 and y, and their sum, §’, 
depend upon a, m, N and 4’. In the case of multiple range tests, they 
also depend upon 2; for a fixed combination of a, m, N and 4’, B’ isa 
minimum for p = 2 and a maximum for p = m. For p = 2, the New- 
man-Keuls test and Duncan’s new multiple range test have critical 
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ERROR TYPE PROBABILITY CODE 
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FIGURE 1 
GRAPHICAL REPRESENTATION OF THREE TypES OF ERROR 


ranges equal to the LSD; hence (6) min = (@3)min = BZ. For p = m, 
the Newman-Keuls test and Tukey’s X procedure have critical ranges 
equal to the WSD; hence (8/)max = (8x)max = By . Table 2 gives values 
of BZ , (B4)max » (8%)min , and . For convenience, these are denoted 
by 8{ , 83 , Bf , and 8{, respectively. Thus one has the following equa- 
tions and inequalities: 


bi = Bi, (12) 
Bi Bs SB, (13) 
SK, (14) 
BS Be SK, (15) 

w= (16) 


In (13), (14), and (15), the left-hand equality holds when p = 2 and 
the right-hand equality when p = m. Error rates, Bg , for Fisher’s 
test have not been computed for m > 2, but they are even larger than 
the corresponding Bf . Hence one has the inequality 


(17) 


where the equality holds only for m = 2, in which case all six tests are 
identical. 
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The values in Table 2, like those in Table 1, were computed for the 
single-classification case, and the values for randomized blocks and 
other designs are slightly different. 

No attempt has been made to specify in the hypothesis under test 
the values or relative values of population means other than yy and 
ut. This does not affect the fixed range tests, and leads to no error in 
the maximum values of 6’, but produces an upward bias in minimum 
values of 6’ for the multiple range tests. This is true since even if the 
range of two adjacent sample means is significant, the two will be said 
to differ significantly only if the ranges of all groups containing them 
are significant. Wine [1955] has made a power study of multiple range 
tests in which the relative values of all population means are specified. 
He tabulated the power for tests on 3 and 4 means. For larger values 
of m, his approach appears forbiddingly difficult, and the present 
approach, though not entirely satisfactory, seems to be the only feasible 
one. 


8. Required Sample Sizes 


If a, B, 5’ and m are fixed, the required sample size N for each of the 
fixed range tests can be determined. For multiple range tests, the 
required sample size N depends upon p. If a, 8, 6’ and m are fixed, 
the minimum required sample size N,,;, occurs for p = 2, and the 


maximum required sample size N,,,. occurs for p = m. 

The code letters for the various tests are used as subscripts on the 
required sample size N; for example, N, denotes the required sample 
size for the LSD test. 

For p = 2, the critical ranges for the Newman-Keuls test and 
Duncan’s new multiple range test are equal to the LSD; hence (Nz) min = 
(Ns)min = Nz. For p = m, the critical ranges for the Newman-Keuls 
test and Tukey’s X procedure are equal to the WSD; hence (Nz) max = 
(N = N W- 

Table 3 gives Nz , (Ns)max » (Nx)min , and Nw for a = 0.05, 0.01; 
B = 0.10, 0.05; m = 2(1)10(2)20; 6’ = 0.5(0.5)3.0. The required 
sample sizes Nz for Fisher’s test have not been computed, but they are 
slightly larger than the corresponding Ny . 

For small values of 6’, the required sample size N is approximately 
inversely proportional to 6”. Hence, for 6’ < 1, the approximate 
value of Né” may be read from the column headed 6’ = 1.0 in Table 3. 

The values in Table 3 have been computed for the single-classifica- 
tion case. To obtain sample sizes which will ensure that a and 6 do 
not exceed the prescribed values for randomized blocks, it can be 
shown that it is sufficient (though not always necessary) to add 2 to 
the sample sizes for m = 2 and to add 1 for m > 3. 


4 
2 
; 


TABLE 3 
Samp.e N ror PRESCRIBED a AND = 0.5 (0.5) 3.0] 
(SINGLE CLASSIFICATION) 
A. a = 0.05, 8 = 0.10 (P = 0.90) 


=10 =15 =20 =25 = 3.0 


8 


7 
7 
6 
6 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
7 
8 
7 
7 
7 
8 
8 
8 
8 
8 
8 
8 
9 
9 
9 
9 
7 
8 
9 
9 


m 
2 86 23 11 | 
3 85 22 10 
N, 4 85 22 10 
85 22 10 
86 23 ll 
91 24 ll 
94 24 11 
97 25 12 
99 25 12 
101 26 12 . 
102 26 12 
(Ns)max 9 104 26 12 
10 105 27 12 
12 107 27 12 
14 108 28 13 
16 110 28 13 
18 111 28 13 
20 112 28 13 
2 86 23 11 
3 96 25 12 . 
4 102 26 12 
5 106 27 13 
6 110 28 13 
7 113 29 13 
8 115 29 14 
(Nx)mio 9 117 30 14 
10 119 30 14 
12 122 31 14 
14 125 32 15 sae 
16 127 32 15 ip 
18 129 33 15 i 
20 131 33 15 a 
2 86 23 
3 106 28 13 
4 120 31 14 
5 130 33 15 
Nw 6 138 35 16 10 
7 144 37 17 10 
8 150 38 17 10 
9 155 39 18 ll 7 
10 159 40 18 11 
12 166 42 19 1l 
| 14 173 44 20 12 
| 16 178 45 20 12 7 
18 183 46 21 12 7 
20 187 47 21 12 7 
4 


TABLE 3 (continued) 
B.a = 0.05, 8 = 0.05 (P = 0.95) 


* =10 =15 =20 =25 8 = 3.0 
2 106 28 13 8 6 5 
3 105 27 13 - 6 4 
N, 4 105 27 13 8 5 4 
5 105 27 13 7 5 4 
6-20 105 27 12 7 5 4 
2 106 28 13 8 6 5 
3 111 29 13 8 6 4 
4 115 30 14 8 6 4 
5 119 30 14 8 6 4 
6 121 31 14 8 6 4 
7 122 31 14 8 6 4 
8 124 32 14 8 6 4 
(Ns)max 9 126 32 15 9 6 4 
10 127 32 15 9 6 4 
12 129 33 15 9 6 4 
14 131 33 15 9 6 4 
16 132 33 15 9 6 4 
18 134 34 15 9 6 4 
20 135 34 15 9 6 4 
2 106 28 13 8 6 5 
3 116 30 14 9 6 5 
4 123 32 15 9 6 5 
5 128 33 15 9 6 5 
6 132 34 16 9 6 5 
7 185 35 16 9 6 5 
8 138 35 16 9 6 5 
(Nx)min 9 140 36 16 10 6 5 
10 142 36 17 10 7 5 
12 146 | 37 17 10 7 5 
14 149 38 17 10 7 5 
16 151 38 17 10 7 5 
18 153 39 18 10 7 5 
20 155 39 18 10 7 5 
2 106 28 13 8 6 5 
3 129 33 16 9 7 5 
4 143 37 17 10 7 5 
5 154 39 18 ll 7 5 
Nw 6 163 41 19 11 8 6 
7 170 43 20 12 S 6 
. 176 45 20 12 8 6 
9 181 46 21 12 8 6 
10 186 47 21 12 8 6 
12 194 49 22 13 9 6 
14 201 51 23 13 9 6 
16 206 52 24 14 9 7 
18 211 53 24 14 9 7 
20 216 55 25 14 9 7 


| 
i 
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TABLE 3 (continued) 
C. a = 0.01, 8 = 0.10 (P = 0.90) 


m =10 #=15 =20 =25 =3.0 

2 121 32 15 10 7 6 

3 121 31 15 9 6 6 

4 120 31 15 9 6 5 
Ny, 5 120 31 14 9 6 5 
6-13 120 31 14 8 6 5 
14-15 120 30 14 8 6 5 
16-20 120 30 14 8 5 5 
2 121 32 15 10 7 6 

3 127 33 16 10 7 5 

4 132 34 16 10 7 5 

5 135 35 16 10 a 5 

6 138 35 16 10 rs 5 

* f 140 36 16 10 7 5 
(Ns)max 8 143 36 17 10 t 5 
9 144 37 17 10 7 5 
10 145 37 17 10 7 5 
12 148 38 17 10 7 5 
14 151 38 17 10 Yi 5 
16 152 39 18 10 7 5 
18 154 39 18 10 7 5 
20 155 39 18 10 Fs 5 
2 121 32 15 10 7 6 

3 131 34 16 10 re 5 

4 138 35 17 10 e 5 

5 142 36 17 10 7 5 

6 146 37 17 10 7 5 

7 149 38 17 10 rf 5 
(Nx) min 8 151 39 18 10 7 5 
9 153 39 1s 11 7 5 
10 155 40 18 11 5 
12 158 40 18 ll 7 5 
14 161 41 19 ll 7 5 
16 163 41 19 11 7 5 
18 165 42 19 11 vi 5 
20 167 42 19 11 7 5 
2 121 32 15 10 yf 6 

3 143 37 11 6 

4 156 40 19 ll 8 6 

5 166 43 20 12 8 6 

6 174 44 20 12 8 6 

7 180 46 21 12 8 6 
Nw 8 186 47 22 13 9 6 
9 191 49 22 13 9 7 
10 195 50 23 13 9 7 
12 203 51 23 14 9 7 
14 209 53 24 14 9 xj 
16 214 54 25 14 9 7 
18 219° 55 25 14 10 7 
20 224 56 26 15 10 7 


A 


TABLE 3 (continued) 
D.a@ = 0.01, 8 = 0.05 (P = 0.95) 


m =15 =20 =25 = 3.0 

2 145 38 18 11 8 6 

3 144 37 17 11 7 6 
4-7 144 37 17 10 7 5 
Nz 8 143 37 17 10 7 5 
9-12 143 36 17 10 7 5 
13-19 143 36 17 10 6 5 
20 143 36 16 10 6 5 
2 145 38 18 ll 8 6 

3 151 39 18 ll 8 6 

4 156 40 19 ll & 6 

5 160 41 19 11 8 6 

6 163 42 19 11 8 6 

7 166 42 19 11 8 6 
(Ns)max 8 168 43 19 1l 8 6 
9 170 43 20 ll 8 6 
10 171 43 20 12 8 6 
12 174 44 20 12 8 6 
14 177 45 20 12 8 6 
16 179 45 20 12 8 6 
18 181 46 21 12 8 6 
20 182 46 21 12 8 6 
2 145 38 18 11 8 6 

3 156 40 19 il 8 6 

4 163 42 19 12 8 6 

5 168 43 20 12 8 6 

6 172 44 20 12 8 6 

7 175 44 20 12 8 6 
(Nx)min 8 177 45 21 12 8 6 
9 180 46 21 12 8 6 
10 182 46 21 12 8 6 
12 185 47 21 12 8 6 
14 188 48 22 13 8 6 
16 191 48 22 13 8 6 
18 193 49 22 13 8 6 
20 195 49 22 13 9 6 
2 145 38 18 11 8 6 

3 168 43 20 12 8 7 

4 183 47 22 13 9 7 

5 193 49 23 13 9 7 
Nw 6 202 51 24 14 9 7 
7 209 53 24 14 10 7 

8 215 55 25 15 10 7 

9 220 56 25 15 10 7 
10 225 57 26 15 10 7 
12 233 59 27 15 10 7 
14 240 61 27 16 10 8 
16 245 62 28 16 ll 8 
18 250 63 29 16 11 8 
20 255 64 29 17 ll 8 
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9. Computation of the Tables 


Let the critical range MSD of a range test on p out of m ordered 
means with significance level ay be represented by 


MSD = Q(x ,p, m, »)sz , (18) 


where M may be J, X, S, or W and Q (ay , p, m, v) is the significant 
studentized range at the ay level for a test on p out of m ordered means 
with » degrees of freedom for s. The equivalent LSD test has sig- 
nificance level a, such that 


tar, = Qlaw , p, m, v)/V2. (19) 


It should be noted that (19) is a generalization of (8). While (8) holds 
only for Tukey’s studentized range test, (19) holds for that test and also 
for the multiple range tests. The part of Table 1 concerned with a, 
corresponding to ag = 0.05, 0.01 is computed directly from (9). The 
remainder of Table 1 is computed from (19). One knows Q(ay , p, m, v) 
and must find a, by interpolation from the probability integral of the 
t-distribution, which has been tabulated by Hartley and Pearson 
[1950]. The method of interpolation is linear harmonic v-wise and three- 
point Lagrangian ¢-wise. For v > 20,t > 4, the Hartley-Pearson tables 
cannot be used. With ¢ > 4, tables of the incomplete Beta-function 
edited by Karl Pearson [1934] were used for »y = 21 and for even values 
of vy up to and including 100. The probability that ¢ exceeds ¢, is given by 


P(t > = (1/2, v/2), (20) 


where 


= v/v + (21) 


In one case (v = 27, ¢ > 4), neither of the above methods is applicable, 
but there a method proposed by Fettis [1954] can be used. 


The combined Type II and Type III error rate, 84, , is given by the 
relation 


Bu = P{t < ,p, m,v) — 8’ VNI/V2}. (22) 


In computing Table 2, one knows ay , m, N, p and 4’, and hence also 
the value below which ¢ must lie. The corresponding probability is 
found by use of the Hartley and Pearson [1950] tables of the probability 
integral of the ¢-distribution or one of the other methods mentioned 
above. It would perhaps be better from a theoretical point of view to 
fix § rather than 6’, but then in computing tables it would be necessary 
to use the non-central ¢-distribution. Available tables of this distri- 
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bution, including those published recently by Resnikoff and Lieberman 
[1957], are such that this does not appear feasible. 

Table 3 is also computed from equation (22) by one of the above 
methods. Here ay , 84, , m, p and 6’ are known, and N is to be de- 
termined. Since only integral values of N are possible, the value of 
N found by solving equation (22) must be rounded to the next higher 
integer. 
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127 NOTE: SOME SIMPLE METHODS OF COMPUTING 
PARAMETERS IN THE ANALYSIS OF VARIANCE 


G. Mapow 
Stanford Research Institute, Menlo Park, California, U.S. A. 


1. Introduction 


The analysis of variance and F-distributions depend on two param- 
eters, the degrees of freedom, f, of the numerator and denominator 
mean squares. 

Tang’s distribution depends not only on the degrees of freedom 
but also on a parameter, A, that determines the power of the analysis 
of variance test. 

The purpose of this note is to present some methods of computing 
both f and A. 


2. Computing f. 


We begin by recalling that the sums of squares that occur in the 
analysis of variance are quadratic forms, and that the number of degrees 
of freedom of the sum of squares is the rank of the quadratic form. 

For example, upon expanding the square, we see that 


(y; 9) = = 9 
where c;; = 1 — 1/n and ¢;; = ¢;; = —1/n; and 7 is the arithmetic 
mean of y,; , --* , ¥,- The number of degrees of freedom of S’ is n —1, 
which is also the rank of q. 

Now the rank of a quadratic form does not depend on the statistical 
assumptions that may be made. Consequently, we shall assume, in 
order to compute f, that the underlying random variables are normally 
and independently distributed with 0 mean and constant variance o’. 

We shall also assume that q/o” is known to have the x’-distribution. 
Various methods of obtaining analysis of variance ratios guarantee 
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this; for example, the likelihood ratio test of the general linear hypoth- 
esis. 
Furthermore, if g/c” has the x’-distribution then 


where f is the number of degrees of freedom or rank of g. Thus one 


’ way to compute the degrees of freedom is to calculate the expected 
value of g. 


Rule 1. Compute the expected value of g under the hypotheses stated 
above. Then - 


E(q) = fo” 


where f is the number of degrees of freedom. 

For example, E >“*., (y; — 9)? = no? — = (n —1)o’ and hence 
f=n-1. 

But rule 1 can be simplified. Since 


» 


it follows that if the uncorrelated random variables y, , --+ , y, have 
0 means and constant variance o”, then 


E@ = o 
and hence f = , (the trace of q). 
Rule 2. Determine c,; by putting y; = land y; = 0,j zing. (Take 
advantage of symmetry in doing so.) Thus, in 
1\’ 1 —1 
— 9) = (1 +a - )a=" 


,n. 


For rule 2 one need not evaluate any expected values. But for 
many quadratic forms the work of computing f can be further reduced 
by using two simple theorems on expected values. 

(1) If y: , --* , Ym are uncorrelated, and have common mean and 
variance o;, , then 


Vy 
| 
3 | 
n 
n 
a 
| 


QUERIES AND NOTES 539 
(2) Under the same assumptions as (1) we have 


It is easy to prove either (1) or (2); the other is an immediate conse- 
quence. 

Now let us illustrate the method. Let a dot, (-), indicate the mean 
for the subscript and let 


Since the expected value of a sum of random variables is the sum of 
their expected values, we need to evaluate 


Ay = by(I)since 5-2. = 


(ti; — 2.) 


rs s = 
Hence 
E(q) = — — 
and 


The approach embodied in the above example can be used on many 
sums of squares especially including those occurring in factorial designs. 


Rule 3. If g is a quadratic form which can be written 

¢= (y; — 9)” 
where y; , -** , y, are uncorrelated, have common means, and common 
variance, , then 

E@ = ¢ — 


and, if y; = &; — &. , where &;, is the mean of s uncorrelated random 

variables £;, , £2 , -** , &:2 , Which have 0 means and common variance 
2 

o then 


= —31)(s — 


Now 
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and, if the process can be continued on £;; , 


E(q) = (r — Is — Nori 


and so on. 


3. Computing X. It is easy to verify that, if E(y;) = a; , then 


E(q) = o°f + C5 


and it has earlier [1] been shown that 


A= C,;4,0; . 
i,7=1 
Thus, A is the same quadratic form in a, , --+ , a, that q is in 
Rule 4. If E(y;) = a; instead of 0, \ is determined by replacing y; by 
a; in S’ where S’ is the sum of squares that becomes g when the squares 
are expanded. 
Suppose, for example, that E(y;) = a. Then if 
S? = 9)’, 
we have 
i=l 
If 


= > — 2%. 25+ 
and E(x;;) = a; + b; , then 
FV OF =0. 


i=-1 j=1 
But if E(x,;) = a;; , then 
A= (a, a..)°. 
i-1 j=1 
REFERENCE 


[1] W. G. Madow [1940]. The distribution of quadratic forms in non-central normal 
random variables. Ann. Math. Stat., 11: 100-104. 
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128 NOTE: A ROUTINE FOR COMPUTING THE DEGREES OF 
FREEDOM IN ANALYSIS OF VARIANCE 


Berti, Mat&rn 
Forest Research Institute, Stockholm 51, Sweden 


This note describes a simple method of automatically computing 
the degrees of freedom along with the sums of squares in an analysis 
of variance. The procedure seems to be useful in routine compu- 
tational work. It may be known to many statisticians, but seems 
not to be described in any textbook. 

A few, rather obvious, remarks will suffice to justify the procedure. 
We consider a number of observations y; , ¥2 , --: . We assume that 
we can write y; = m; + e; , where the “errors” {e;} are normally and 
independently distributed around zero with constant variance V. 
By 2, , 22, °** , we denote linear forms in the y’s with error variance 
V. Thus every z is of the form >-a,y; with }ca? = 1. In standard 
applications we start by computing certain sums P, , P, , --+ , where 
every P is a sum of squares of some z’s. We then form sums of squares 
attributed to different sources of variation by calculating expressions 
of the type Q = )-9;P; , where the g’s are constants (positive, negative, 
or zero). In all regular cases Q/V has a chi-square distribution under 
the additional hypothesis that the source of variation in question does 
not affect the m’s. The number of degrees of freedom associated with 
Q then equals the expectation of Q/V. From the definitions of {P;} 
and {z,}, it is seen that this expectation is gq = 2 g;p; , where p; is 


TABLE 1 
EXPERIMENTAL OBSERVATIONS (FICTITIOUS) 


Treatment Block I Block II Sum 
AoBo 2.7 2.9 5.6 
AoBi 2.0 3.6 5.6 
AiBo 3.1 3.3 6.4 
A,B, 3.8 2:7 6.5 
A2Bo 2.5 4.3 6.8 
AB, 4.3 4.0 8.3 
Sum 18.4 20.8 39.2 


Treatment totals: 
Ae:11.2 A, :129 Az:15.1 Bo:188 B, : 20.4 


| 
| 
| 


542 


BIOMETRICS, DECEMBER 1957 


the number of squared z’s in P;. Thus, Q and q are obtained by the 
same linear operation carried out on the P’s and p’s, respectively. 


Therefore, Q and g can conveniently be computed parallelly. 


TABLE 2 
CaLcuLATION OF {P;} AND {p;} 
Number 
Grouping Sum of squares of 
squares 
1) Individual 
2 fe 2 
2 2 2 
(3) A 11.2° + + 15.1 = 129.9650 |p = 3 
2 2 
@ B = 128.2667 2 
2 2 
(5) Blocks p,= BEES = 128.5333 |m= 2 
(6) Total P, = 32 = 128.0533 |m= 1 
TABLE 3 
ANALYSIS OF VARIANCE 
Computation, from Source of 8.8. . df. m.8. 
rows of table 2 variation 
(5) — (6) Blocks 0.4800 1 0.4800 
(3) — (6) A 1.9117 2 0.9558 
(4) — (6) B 0.2134 1 0.2134 
(2) — (8) — (4) + ©) AB 0.3516 2 0.1758 
(1) — (2) — (5) + ©) Error 3.1100 5 0.6220 
(1) — (6) Total 6.0667 11 


Example (interaction AB): 
Q = P, — P; — Py + Pe = 0.3516 


. 
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To give an example, we consider a 3 X 2 factorial experiment in 2 
blocks. The factor A has the three levels A, , A; , and A,. The factor 
B has the two levels B, , and B,. Table 1 shows the data of the ex- 
periment. The computation of the sums P is given, together with 
the numbers p, in Table 2. Finally, the sums of squares, and degrees 
of freedom, as derived by linear operations on P and p, respectively, 
are shown in Table 3. The example is only intended as an illustration 
of the computational technique. The procedure is, of course, more 
useful in complicated cases. 


CORRECTIONS TO PAPERS 


CHIN LONG CHIANG. An Application of Stochastic Processes to 
Experimental Studies on Flour Beetles. Biometrics, 13(1), 79-97, 1957. 


On page 88, in line 20, 6 = (6, , 6., ---, 6,) should be read as 6 = 
(6, , 62, «++, 6,); in line 29, ¢;(@) and in line 30, ¢,(6) should both be 
read as ¢;(8). On page 90, in equation (48), d,, should be read as d,, . 


F. J. ANSCOMBE. Fixed-Sample-Size Analysis of Sequential Ob- 
servations. Biometrics, 10(1), 89-100, 1954. 


The two consecutive sentences half-way down p. 100, in which the 
names of Fisher and Wald are mentioned, should be deleted. It is 
true that in some instances a distribution obtained by Fisher’s fiducial 
argument, or a decision function obtained by Wald’s minimax rule, 
could have been derived from Bayes’ theorem. But I had not ap- 
preciated, when writing this paper, that the “prior” distribution that 
must be inserted in Bayes’ theorem in order to obtain Fisher’s or Wald’s 
result, may itself depend on the sampling rule. The matter is dis- 
cussed in a paper, ‘‘Dependence on the fiducial argument on the sampling 
rule,” to appear in Biometrika, 44 (1957). 


H. R. THOMPSON. Extensions to Missing Plot Techniques. Bio- 
metrics, 12(3), 241-244, 1956. 


I am grateful to George Hirschhorn for pointing out an arithmetical 
error in one element of the matrix on page 244 of my paper referenced 
above. The leading element should be 43470 and not 44345. All 
entries in the matrix were obtained from the solution for p = 3 and 
divided by 4 to reduce the number of digits. The missing values in 
Table I are correct, however, as they were calculated using the original 
correct matrix. 


“4 


APPRECIATION 


When Dr. J. W. Hopkins assumed responsibility for Biometrics, in 
January 1956, the Society looked forward with anticipation to a long 
and successful term with the journal under his editorship. 

Members will learn with regret that John Hopkins later had the 
misfortune to become ill, and was forced to relinquish office in March 
of this year, after having fought a losing battle for many months with 
recurrences of the illness. He felt very keenly his inability to carry on 
with the job he had undertaken for the Society and the journal, and 
it was characteristic of the man that he did not wish to retain a re- 
sponsibility which he could not effectively discharge. 

We are greatly indebted to John Hopkins for the sterling service he 
has given, not only for his contribution as Editor, when the major part 
of his work was carried out under extreme difficulties, but also earlier 
as Chairman of a former Finance Committee, as a Councillor, and as 
an Associate Editor. I take this opportunity of extending to him, on 
behalf of the Society’s membership, our warmest thanks, and best 
wishes for a rapid recovery. 

Thanks are also due to Mrs. Florence H. Suddon, for the unselfish 
and competent assistance she gave the Editor, especially during the 
periods of his enforced absences. 

We gratefully acknowledge, too, the very generous contribution 
made by the National Research Council of Canada, in providing office 
accommodation and the numerous facilities necessary for the efficient 
conduct of an Editor’s duties. 


E. A. Cornish 
President 
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ABSTRACTS 


Papers presented at the meeting of the Biometric Society (W.N.A.R.) with the American 
Institute of Biological Sciences at Stanford University, Stanford, California, August 
27 and 28, 1957. 


ROSEDITH SITGREAVES (Teachers College, Columbia Uni- 
versity). Discriminating Analysis in a Non-Classical Case. 


In the classical theory of discriminatory analysis, a linear function 
(i.e., Fisher’s discriminant function) of a set of measurements for a 
given individual is used in classifying the individual as belonging to one 
of two populations. This function has been shown to possess some 
optimal properties if the measurements are continuous, and are normally 
distributed in each population with the same covariance matrix. Some 
recent work of Bahadur has been concerned with the problem of classi- 
fication when the measured variables are all dichotomous, and he has 
developed some approximations to the optimal classification statistic 
for this case. In the present paper, a probability model is constructed 
for an experimental situation in psychology involving ‘aggressive’ 
and “non-aggressive” children, and Bahadur’s results are used to de- 
termine an appropriate classification index. This index is evaluated 
in terms of the probabilities of misclassification. Approximations to 
the probability distributions and the resulting classification index are 
also considered. 


ALBERT C. WALKER (International Minerals and Chemical 
451 Corporation, Woodland, California). Canonical Vectors among 
Sugar Beet Varieties. 


A canonical analysis of the results of an experiment on thirteen 
varieties of beets is presented. Nine of these had been chosen because 
of high sucrose content. Highly significant differences were present 
both in three commercial characteristics, root weight, sucrose, and 
glutamate content, and in the three auxiliary measurements of top 
weight, sodium, and amino acids. Although there was considerable 
correlation between some of these variables, no reduction in the number 
of dimensions necessary to depict the varietal differences could be 
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obtained. The canonical vectors obtained are interpreted as the best 
available measures of the underlying genetic differences. The order of 
importance, for this experiment, was found to be (1) vigor, or all-over 
utilization of energy, (2) distribution of energy between growth and 
storage in the form of sucrose or glutamate, (3) nitrogen foraging 
ability, (4) distribution of growth between root and top, and (5) a 
curious positive correlation between sodium and sucrose, in two varieties 
known to be carrying a gene causing albinism. 


WILLIAM F. ROYCE (U.S. Fish and Wildlife Service, Juneau). 


= Statistical Comparisons of Morphological Data. 


The methods of statistical comparison of morphological data are 
examined with special regard to techniques suitable for racial studies 
of tunas. Comparison of samples by means of a test of significance is 
discarded because it leads to a trivial conclusion. In its place is sug- 
gested and defined a measure of overlap of frequency distributions. 

The computation of overlap for single counted characters is extended 
to single measured characters through the use of regression analysis 
and to multiple characters either counted or measured or both. 

The general concept of overlap is related to the proportion of one 
sample which might belong to another specified sample of equal size. 
This is considered to be a maximum for any intermingling which might 
have occurred. It is pointed out that such a maximum is useful when 
considered in conjunction with tagging returns. 


HARLEY B. MESSINGER (University of California, Berkeley). 


- Clinical Trials in Breast Cancer, Statistical Problems. 


The commonest measures of treatment effectiveness in cancer are 
based on survival time after diagnosis or treatment. In the case of 
breast cancer, there is a strong dependence of survival time upon the 
age of the patient. Actuarial analysis of the excellent data of the 
Connecticut Cancer Registry reveals that this age effect can be removed 
by dividing the survival percentage at each point in time after diagnosis 
or treatment by the percentage that would be expected to survive to 
that point in time among a group of people of the same age, sex, and 
race from the general population. 

This fact can be utilized, for example, in the Bross sequential method 
of treatment comparison to avoid the need for matching pairs of patients 
by age (see Billewicz’ article in Biometrics 12:283, 1956). In each un- 
matched pair, one would simply observe which patient lived for the 
greater fraction of her expected survival time. 
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The common fractions obtained with above correction from the 
various age groups by weighted averaging increase with the log of time 
like a cumulative normal distribution function. The lognormal distri- 
bution has been fitted before to uncorrected survival curves, but this 
technique has worked well only in cases where the mortality from cancer 
was so high that normal mortality was negligible, e.g. lung cancer. 

Further, there are indications that the lognormal distribution de- 
scribes well the distribution of intervals from treatment until the 
appearance of metastases as well as certain other similar classes of 
intervals which might have utility as measures of treatment effective- 
ness. Thus, it may be possible to combine other measures of effective- 
ness with survival time by the use of multivariate normal techniques 
to permit much earlier decisions to be made regarding the relative 
merit of two or more treatments. 


MOHAMED 8. AHMED (University of California, Berkeley). 
454 Test that the Probability of Disease is Independent of the Total 
Length of Time Previously Suffered from the Disease. 


Consider an individual who has suffered from a disease for a total 
length of time r. Let P(t; 7) be the probability that, if he contracts 
the disease again, he remains sick for at least time ¢. Assume that 
P'(t; vr) leno = —Q(r). The distribution of the total time 7’ spent in 
sickness in n successive returns is obtained. Let g(r) = a + 2br, with 
a and b non-negative and a known. We test the hypothesis of inde- 
pendence, b = 0. The test criterion is to reject the hypothesis of 
independence whenever 7’ < k(n, a)/a, where k(n, a) is adjusted in 
accordance with the desired level of significance a. The test is uni- 
formly most powerful for the restricted class of alternatives b > a’k 
(n, w). A formula for the power function is given. Similar results 
can be obtained for other situations for example, g(r) = a + 2br + 3cr’. 
The problem is more complicated when more than one individual is 
considered. A method for the estimation of parameters is given and 
asymptotically similar tests are worked out for this case. Other appli- 
cations include (2) accident-free flying experience of pilots and (i) 
tunnelling of flour beetles. 


W. F. TAYLOR (School of Aviation Medicine, Randolph Field, 
455 Texas). Some Monte Carlo Methods Applied to an Epidemic of 
Acute Respiratory Disease. 


A probabilistic model and its deterministic counterpart are developed 
to describe the course of acute respiratory disease among recruits at 
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the former Air Force reception center at Sampson Air Force Base, New 
York. Four parameters are estimated and the model is evaluated 
by comparing observed epidemics which occurred among 117 groups of 
recruits with 415 analogous Monte Carlo epidemics resulting from using 
the model. Computations were performed with the aid of a CRC 102A 
electronic computer. 

The following assumptions were made in building the model: 

1. Sources of infection exist both from within and without the 
group. 

2. The risk of a susceptible man becoming infected from within the 
group in a small increment of time, At, is proportional to the number of 
infectious men, J, within the group. Risk from within = XJ. 

3. The risk of infection from without the group is constant. Risk 
from without = uz. 

4. Once recovered from infection, a man has partial immunity, and 
the constants \ and y in 2 and 3 above are both altered by multiplying 
them by the same factor. Risk from within = adJ. Risk from with- 
out = au. 

5. There exists an unknown proportion, 8, of men who become 
infected but are not detected. These men remain in the group and 
provide the pool of infection from within. 


J. GURLAND AND §&. K. KATTI (Iowa State College, Ames, 


456 Iowa). Generalized Families of Contagious Distributions. 


The notion of compound and generalized distribution is utilized to 
obtain the probability generating functions of certain families of con- 
tagious distributions. These families contain the family of Beall and 
Rescia (Biometrics, 1953) as a special case. Properties of the confluent 
hypergeometric function are applied to obtain formulae for computing 
probabilities. Some limiting cases of the families of distributions are 
also considered when certain of the parameters are allowed to become 
infinite. In particular, it is shown that the Neyman Type A and the 
generalized Polya-Aeppli distributions emerge as the limiting form in 
some cases, Different methods of estimation, including the method of 
moments and estimation by frequencies, are also considered in a pre- 
liminary investigation of some data. 


457 DOUGLAS G. CHAPMAN (University of Washington, Seattle, 
Washington). Problems of Estimation of Wildlife Mortality Rates. 


The estimation of mortality rates of wild animals in general has been 
based on one of two methods—recoveries of tagged or marked members 
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of the population and sampling of the population to determine its age 
structure at one or more time periods. The sources of bias in these 
methods are discussed. Several different formulae are reviewed and a 
new formula given which may be useful for those fish and animals 
where the largest cause of mortality is predation by man. The effect 
of the various sources of bias on the formulae is noted. Consideration 
is also given to the determination of estimates of the variances of the 
estimates; it will be usually preferable to find internal rather than 
external estimates of such variances. 


B. O. BERGH (University of California, Riverside, California.) 
458 Dominance Estimates from Tomato Populations Segregating for 
Locule Number. 


Tests of scale adequacy indicated the desirability of a logarithmic 
transformation of the data. Despite some evidence that this trans- 
formation was not strong enough, it was statistically acceptable ac- 
cording to the tests used. 

Estimates of the average absolute dominance of the segregating 
genes were made by two methods. That of Fisher, Immer, and Tedin 
[1932], amplified by Mather [1949], gave highly variable and statis- 
tically unreliable values. However, the results obtained indicate that 
the method developed by Comstock and Robinson [1952] was an 
efficient way to estimate genic dominance in these populations. 

Two major loci were found to be largely responsible for locule 
number differences in these populations. It was possible to assign 
genotypic values to each of the nine major locule number genotypes. 
From these values, a theoretical mean absolute dominance estimate 
could be calculated, based on the two major genes only. This estimate, 
0.66, was not markedly different from the 0.60 estimate obtained for 
all segregating genes by the method of Comstock and Robinson. Sim- 
ilarly, the potence estimate from the two major genes, 0.42, agrees 
closely with the estimate from the non-segregating generations, 0.43. 


W. A. BECKER (State College of Washington, Western Wash- 
459 ington Experiment Station, Puyallup, Washington). Computing 
Mortality in Experiments with Irrelevant Deaths. 


When computing mortality due to a specific cause, a, the experi- 
mentor has to make an adjustment for individuals that died of all other 
causes, b. Let us assume that a and b are independent and no individuals 
die of joint causes a and b. The mortality due to a can be computed by 
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calculating the proportion of individuals that did not die of a during 
each period of time, x, say the 24 hour periods in a yearly experiment. 
These proportions are multiplied and the product subtracted from one 
to give the desired mortality figure. A certain sample size has to be 
maintained during the experiment. 


HANS ABPLANALP (University of California, Davis, Califor- 
nia). Effective Population Size Under Selection. 


Effects of artificial selection procedures on the relation between 
actual and effective population size are studied. The analysis is based 
on data from two generations of eleven closed chicken breeding flocks. 
Estimates of effective population size are based on methods derived by 
Crow and Morton (1954, Evolution 9: 202-214), contrasting expected 
sampling variance in gene frequency of an ideal population of N, in- 
dividuals with an estimate of the sampling variance derived from the 
data. 

Four selection procedures are applied to the same breeding records, 
contrasting natural selection to 10 months of age, mass selection for 
a trait with high heritability (body weight), and two selection pro- 
cedures for a trait with low heritability (egg production) where family 
averages are used to supplement information from individual pro- 
duction records. 


R. W. ALLARD (University of California, Davis, California). 
461 Effect of Genotypic—Environmental Interactions on the Predic- 
tion of Genetic Advance under Selection. 


Earliest and latest plants were selected in segregating generations 
of the wheat hybrid Baart X Ramona to determine the progress possible 
under different intensities of selection. In a parallel series of experi- 
ments, estimates were made of various genetic and environmental 
components of variance. Predictions of genetic advance under selection 
based on these estimates often did not correspond closely to the actual 
genetic advance. Although conspicuous interactions between genotype 
and environment occurred in these materials, these interactions did not 
appear to be the main cause of the failure of the predictions. The 
disturbing effect of a major gene on the selection differential, and non- 
allelic gene interactions, unaccounted for in the prediction equation, 
were advanced as more probable explanations for the discrepancy. 
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EK. NOVITSKI AND E. DEMPSTER (Oak Ridge National 
Laboratory, Oak Ridge, Tennessee, and Department of Genetics, 

462 University of California, Berkeley, California). Analysis by 
Automatic Computer of Data from Laboratory Populations of 
Drosophila Melanogaster. 


In some cases heterosis (hybrid vigor) is inferred because a popula- 
tion appears to contain more heterozygotes than can be accounted for 
by ordinary genetic theory. A mathematical model based on a maxi- 
mum likelihood formulation has been set up that allows an estimate to 
be made of the degree to which irregularities in genetic data required 
the assumption of heterosis. The analysis based on such a model is 
not practicable by ordinary methods, and for this reason an automatic 
computer (the Oracle at Oak Ridge National Laboratory) was used. 
The output of the computer showed that the assumption that heterosis 
is responsible for the increased frequency of the heterozygotes is un- 
warranted, and that a reduced viability of homozygous classes will 
equally well explain such data. This result is of some interest because 
it shows that one of the most commonly applied tests for heterosis, a 
statistical test for an excess of heterozygotes in a population, is actually 
ambiguous. 


L. H. KOOPMANS AND DOROTHY C. LOWRY (Uni- 
463 versity of California, Berkeley, California). Estimation of Mean 
and Variance of a Distribution of Binomial Probabilities. 


The following binomial probability situation is frequently en- 
countered. Each individual in a population P has associated with it 
a probability p which reflects the individual’s potential for inducing 
a characteristic C in each of a number of items it produces. Suppose 
that the distribution function of these probabilities is F(p) with mean 
and variance z and o,. A random sample of k individuals is selected 
from P. For each z (i = 1, 2, --+ , k), n; items from the production of 
the individual are chosen at random and are tested for the presence of 
the characteristic C. On the basis of the resulting numbers of items 
possessing C' we have derived unbiased estimates of rand o,. A natural 
estimate of a is 


k 
where n = ><n; and X;, is the observed number of items possessing 
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the characteristic C from the i“ individual. An unbiased estimate of 
2 
6,,18 


n—1 
- 


Exact expressions were derived for the variances of # and 6} . 


ALBERT C. WALKER (International Minerals and Chemical 
464 Corporation, Woodland, California). Balanced Incomplete Diallel 
Crosses. 


Progeny testing of genotypic values can be extended to a wider 
range of parents by use of an incomplete diallel. Comparison may be 
equalized and calculation simplified by application of the balanced 
incomplete block technique. A table of the number of males, number 
of females, and number of crosses per parent (complete to 200 pro- 
genies), necessary for balanced comparison, is presented. When the 
number of males is different from the number of females, only the 
smaller set gives complete balance. Cases where the larger set affords 
partial balance with only two associate classes are indicated. 

Calculations of best estimates of parental values and their variances 
are detailed, with two examples involving glutamate content of pro- 
genies from sugar beet diallel crosses. 
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THE BIOMETRIC SOCIETY 


REGIONAL 
E.N.A.R. 


The Tenth Annual Meeting was held in Atlantic City, September 
10-13, 1957. This was also a joint meeting with the American Statis- 
tical Association and the Institute of Mathematical Statistics. 

Some of the session subjects and speakers were: 


A. Methodology and Application of Response Surface Exploration: 
J.S. Hunter, R. M. DeBaun; 

B. Forecasting Crop Yields: G. M. Kuznets, Y. Mundlak, H. F. 
Huddleston, R. Parr, L. Calvin; 

C. Stochastic Processes in Biology: J. G. Huffman, E. Fix, E. K. 
Harris; 

D. Genetic Statistics: W. R. Harvey, D.S. Robson, O. Kempthorne; 

E. Fractional Replication: R. L. Anderson, J. D. Hromi, M. 
Carroll, C. Daniel; 

F. Psychometric Statistics: H. Gulliksen, F. M. Lord; 

G. Tolerance Limits: J. E. Walsh, E. A. Thomas, A. Lieberman; 

H. Clinical and Field Trials in Public Health and Medicine: F. M. 
Hemphill, 8. W. Greenhouse, J. Williams, P. Armitage; 

I. Diet and Heart Disease: F. Moore, M. Robins, E. A. Lew. 


Attendance at several sessions was 150 to 200. The annual business 
meeting was also held in Atlantic City. The minutes follow this report. 


E.N.A.R. Minutes of Tenth Annual Business Meeting 


The Tenth Annual Business Meeting of the Eastern North American 
Region of the Biometric Society was called to order at 2:00 p.m., Sep- 
tember 12, in the Ambassador Hotel, Atlantic City. The presiding 
officer was Boyd Harshbarger, the Regional President. There were 
forty-four persons in attendance. The minutes of the Ninth Annual 
Business Meeting were read and approved. 

S. M. Free, Chairman of the Regional Advisory Board, reported on 
a recommendation by that body that the ENAR By-laws be revised 
to make provision for a Regional President-elect. The reasons advanced 
by the RAB for the possible revisions were: 
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1. It would improve the continuity of the office of regional president. 
At present, a president is usually returned to office for a second term. 
At the end of that term a completely uninitiated person takes over the 
position. 

2. The presidential honor (and duties) would be given to more 
people. 

3. The president-elect could provide a needed over-all program 
chairman for ENAR’s many meetings. 

The amendments proposed by the RAB are as follows: 


1. By-law 3. Regional Committee 
Change the first sentence to read: 
“The Regional Committee shall consist of the Regional Presi- 
dent who shall serve as its chairman, the Regional President- 
elect, the Regional Secretary-Treasurer, and six ordinary members 
who shall serve three-year terms.” 
2. By-law 7. Program Committees 
Delete present By-law and substitute the following: 
“The Regional President shall appoint program committees to 
arrange meetings in connection with those of specific organiza- 
tions or groups of organizations. The Regional President-elect 
shall act as program coordinator and shall establish appropriate 
liason with program committees of organizations with whom the 
Region meets. He shall report the tentative program plans for 
the coming year at the Annual Meeting.” 
3. By-law 8. Election 
Change the first sentence to read: 
“On or before August 1 each year the Regional Advisory Board 
shall submit nominees for Regional President-elect, Regional 
Secretary-Treasurer, and two nominees for each vacancy among 
ordinary members of the Regional Committee to the Regional 
Secretary-Treasurer.”’ 
4, By-law 8. Election 
Add the following sentence: 
“The Regional President-elect shall serve for one year and at the 
close of this term automatically become Regional President, 
subject to confirmation by the Council of the Society.” 


These amendments were voted on by those attending the meeting. 
The amendments passed without a dissenting vote. 

It was also moved that the ENAR secretary be permitted to cir- 
culate ballots by mail for the election of a Regional President-elect for 
1958. The RAB had already nominated for this position and it was 
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suggested that the ENAR secretary should send these ballots to the 
membership as soon as practicable after the present meeting. 

A short report on the plans for each of the following meetings was 
given: 


Gatlinburg meeting on April 10, 11, 12, 1958. 

Indianapolis meeting after Christmas, 1957. 

The Fourth International Biometric Conference and International 
Symposium on Biometrical Genetics to be held in Ottawa August 
28 to September 2, 1958. 


It was moved that the Chicago meeting with the ASA December 
27-30, 1958, be designated the Eleventh Annual Meeting of ENAR. 

Ralph Bradley was introduced as the new editor of Biometrics. He 
said that: the June issue would be out soon, the September issue on the 
Analysis of Covariance will be late, and the December issue will be out 
on time. 

C. I. Bliss offered the following resolution to be suitably communi- 
cated to John W. Hopkins: Resolved that the Eastern North American 
Region of The Biometric Society express its sincere appreciation for the 
interest, enthusiasm, and leadership contributed by John W. Hopkins 
as editor of Biometrics and in his service to the Region in that capacity. 

The resolution was unanimously subscribed to by the meeting. 

R. G. Hoffman initiated a discussion concerning the problem of 
communication and interchange with the Medical Sciences. It was 
suggested that we should continue meeting with the FASEB and 
initiate further meetings with the medical clinical: groups. It was 
suggested that perhaps a committee should be set up to implement 
such meetings. After considerable discussion of initiating further 
meetings of ENAR it was moved, seconded and passed that the Re- 
gional President should appoint a committee to explore and arrange 
with the advice and agreement of the Regional President-elect the 
feasibility of a joint meeting with the American Medical Association, 
the American College of Surgeons, or a similar group at the conlient 
effective and satisfactory date. 

The meeting was adjourned at 3:05 p.m. 


Respectfully submitted, 
Arthur M. Dutton 


Région Belge 


Le 25 octobre 1957, la Société Adolphe Quetelet a eu le plaisir et 
Vhonneur d’accueillir 4 sa tribune le Professeur Jerzy NEYMAN de 
l'Université de Californie de passage en Belgique ainsi que le Profes- 
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seur Robert CONSAEL de |’Université de Bruxelles. Le séance était 
organisée 4 |’Institut Agronomique de |’Etat 4 Gembloux et réunissait 
une nombreuse assistance de professeurs et d’étudiants ainsi que des 
membres de la Société, en présence de Monsieur HESPEL, Recteur de 
l'Institut. La conférence du Professeur NEYMAN portait sur un 
travail original qu’il a développé en collaboration de Mademoiselle 
SCOTT sur la Théorie probabiliste du développement des populations 
biologiques congues comme conglomérations de familles. Ce brillant 
exposé donné en frangais a été suivi d’une discussion animée dirigée par 
le Président de la Société Adolphe Quetelet, Monsieur R. LAURENT. 
Le Professeur CONSAEL a ensuite parlé du Processus de Poisson & une 
ou deux variables aléatoires. Lors de la discussion qui suivit, le Pro- 
fesseur NEYMAN mit en lumiére la parenté existant entre cet exposé 
et le sien, insistant sur la portée pratique de I’un et de |’autre. 


OFFICERS 
E.N.A.R. 


The following officers and committee members were elected by mail 
ballot by the members of the Eastern North American Region:— 
Regional President for 1958—Boyd Harshbarger; President-elect for 
1958—Jerome Cornfield; Secretary—T. W. Horner; Regional Com- 
mittee for 1958-1960—D. B. Duncan and H. O. Hartley. The Presi- 
dent-elect, who will become President in 1959, is a new officer. One of 
the duties of this officer will be to handle the meetings and programs 
of the E.N.A.R. 


W.N.A.R. 


The Regional officers for 1958 are as folows:—President—J. L. 
Hodges, Jr.; Secretary-Treasurer—Miss M. M. Sandomire; Regional 
Committee—1956-1958, Bernice Brown, L. Calvin; 1957-1959, W. J. 
Dixon, J. C. R. Li; 1958-1960, D. G. Chapman and D. Wohlschlag. 
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NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary 
(if members at large, to the General Secretary) news of appointments, 
distinctions, or retirements and announcements of professional interest. 


Marcus Kjelsberg, University of Minnesota, who spent the last year 
as Special Research Fellow in the Epidemiological Research Unit of 
the Cardiovascular Department of Michael Reese Hospital in Chicago, 
has joined the staff of the Department of Tropical Medicine and Public 
Health, School of Medicine, Tulane University as Instructor in Bio- 
statistics. 


MEETING OF THE BIOMETRIC SOCIETY, E.N.A.R. 


The Biometric Society (E.N.A.R.) will meet jointly with the In- 
stitute of Mathematical Statistics at Gatlinburg, Tennessee, Thursday, 
Friday, and Saturday, April 10, 11, and 12, 1958. Titles and abstracts, 
the latter in duplicate in the form published in Biometrics, for con- 
tributed papers for the Biometric Society should be sent to Donald A. 
Gardiner, Oak Ridge National Laboratory, Post Office Box Y, Oak. 
Ridge, Tennessee, not later than March 10, 1958. 


SYMPOSIUM ON BIOMETRICAL GENETICS AND THE 
FOURTH INTERNATIONAL BIOMETRICAL CONGRESS 


The Symposium on Biometrical Genetics and the Fourth Inter- 
national Biometrical Congress will meet in Ottawa, Canada, from 
August 28 to September 2, 1958. May 1 is the deadline for pre-regi- 
stration and for receiving papers and manuscripts. Queries regarding 
programmes and contributed papers are to be sent to Mr. M. J. R. 
Healy, Secretary, The Biometrics Society, Rothamsted Experimental 
Station, Harpenden Herts., England; queries concerning local ar- 
rangements should be sent to Mr. G. B. Oakland, Statistical Laboratory, 
Science Service Building, Central Experimental Farm, Ottawa, Canada. 


NATIONAL SCIENCE FOUNDATION TRAVEL AID 


The National Science Foundation will award individual grants to 
defray partial travel expenses for a limited number of American scientists 
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participating in the Special Session of the International Statistical 
Institute to be held in Brussels, Belgium, September 1-6, 1958. 

Application blanks may be obtained from the National Science 
Foundation, Washington 25, D.C. Completed Application Forms must 
be submitted by March 1, 1958. 


NEW COOPERATIVE PROGRAM IN BIOSTATISTICS 


The Virginia Polytechnic Institute, Blacksburg, Virginia, and the 
Medical College of Virginia, Richmond, Virginia, have initiated a 
cooperative graduate training program in biostatistics through a grant 
from the National Institutes of Health. This program will be supported 
by the combined facilities of the two institutions and is designed to 
meet in part the critical demand for statisticians in the medical and 
public health professions. It is proposed to train both M.S. and Ph.D. 
candidates in statistics as well as to give post doctorate training in the 
field of statistics to holders of Ph.D. degrees in related fields. 

Most of the course work will be given at the Virginia Polytechnic 
Institute, but certain specific courses relating to medical and public 
health statistics will be taken by the candidates at the Medical College 
of Virginia in Richmond. It is expected that in general a doctoral 
candidate will spend one year of his graduate study at the Medical 
College of Virginia. 

To carry on this program, a staff of nine in the Department of 
Statistics at the Virginia Polytechnic Institute is being increased by one 
professor specifically trained in biostatistics, and the Medical College 
is adding a professor of biostatistics who will supervise the work in 
statistics at that division. The number of fellowships for candidates 
for graduate degrees now existing at the Virginia Polytechnic Institute 
is being increased in number as well as in amount to take care of this 
new field of study in biostatistics. In addition to these fellowships 
post doctorate fellowships are available under this new program for 
students in biology, medicine, and other related fields who wish to 
study statistics. 
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Graphics, 185, 407, and see computation 
Greenhouse experiments, 235 
Group, Abelian, 411 
Group divisibility, 2 
Growth, 239, and see allometry, biom- 
etry, logistic curve, populations 
Harmonic analysis, 241 
Hematology, 485 
Heredity, see genetics 
Heritability, 111, 443, 550, and see 
genetic correlation 
Horticulture, 127, 355, and _ see 
agronomy, greenhouse experiments 
Hypergeometric, 498 
Hypothesis, see model, tests 
matrix, 421 
null, 269, 323, 346, 459, 502 
Identifiability, 93 
Immunology, 179 
Incomplete blocks, 1, 320, 422, and see 
lattices, linked blocks, randomized 
blocks 
balanced, 16, 239, 320, 422, 552 
convariance analysis, 280, 309 
group-divisible, 197, 422 
partially balanced, 2, 17, 197, 215, 
244, 320 
row-balanced, see incomplete Latin 
square 
Incomplete experiments, see missing 
values 
Industrial research, 222, 391, 401, 411, 
and see physical research 
Information, 178, 285 and see analysis 
of variance, design of experiments, 
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estimation, 
precision 
interblock, 1, 239, 268, 315 
intervariety, 412 
Inner product, 456 
Instrumental variate, 290, 302 
Interaction, 341, 455, 498, and see analy- 
sis of variance, model 
as error term, 140 
assumed zero, 333 
interpretation of, 498, 550 
of treatment and location, 498 
Interference, 99 
Interpolation, 535 
Irrigation, 506 
Iteration, 29, 54, 399, and see computa- 
tion, least squares, maximum likeli- 
hood 
Judging, see organolepsis 
Kronecker product, 19 
Latent root, 185, and see matrix algebra 
Latin rectangle, see incomplete Latin 
square 
Latin square, 12, 385 
incomplete, 1, 385 
missing values, 364 
Lattice, 
balanced, 215 
latinized rectangular, 215 
near-balance rectangular, 215 
rectangular, 215 
simple, 17, 169 
triple, 17 
Least significant difference, 512 
test, protected, 513 
Least squares, 57, 88, 202, 225, 245, 270, 
313, 334, 490, and see adjusted 
means, analysis of variance, esti- 
mation, fitting constants, matrix, 
normal equations, precision 
iterative computation, 238 
weighted, 437 
Life tables, 435 
Life testing, D67.1, 433, 497 
Likelihood, see maximum likelihood 
Likelihood ratio, 8, 54, 538 
Linkage, 98, 453, and see interference 
quasi-, 100 
Linked blocks, 198 
Logistic curve, 28, and see growth, logit 
transformation, population 


maximum likelihood, 
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Main effect, 463, and see analysis of 
variance, interaction 
Markov, 416 
Matching, see balancing, paired com- 
parisons 
Matrix, 52, 244, and see covariance ma- 
trix, determinantal equation, eigen 
value, inner product, latent root, 
multivariate analysis, quadratic 
forms 
algebra, 19, 416 
design, 421 
dispersion, 184, 420 
hypothesis, 421 
inverse, 364 
stochastic, 418, 419 
truncated, 420 
Maximum likelihood, 4, 28, 66, 245, 413, 
437, 461, 478, 551, and see efficiency 
estimation, information 
estimate of variance, 485 
Mean, see moments 
adjusted, 13, 266, and see least squares 
Medicine, see clinical, genetics, hema- 
tology, immunology, pharma- 
cology, physiology, virology 
Metameter, 35, and see scales 
Metric, 98, and see scales 
Microbiological assay, 35, 
bacteriology, bioassay 
Minimum chi square estimate, 109, 462 
Missing values, 57, 115, 140, 267, 310, 
363, 387, and see fitting constants, 
rejection of data 
computation of, 3 
variance of, 115 
Model, see biometry, causation, com- 
ponents of variance, hypothesis, 
regression, transformations 
analysis of variance model I, 498 
bioassay, 136, 150 
genetic, 103 
inappropriate, 501 
mathematical, 4, 261, 312, 334, 376, 
412, 422, 433, D66 
mixed, 348 
statistical, 115, 200, 475, 486, 498 
stochastic, 80 
Moments, see cumulants 
factorial, 69 


and see 


7 
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generating function, 81 
method of, 68, 491 
Monte Carlo methods, 547 
Mortality, 546, 548, 549 
Multinomial distribution, 451 
Multivariate analysis, 19, 177, 282, 298, 
412, 546, and see analysis of vari- 
ance, basis, determinantal equation, 
discriminant function, factor anal- 
ysis, generalized distance, matrix, 
principal component, vector 
confidence limits, 420 
Newman-Keuls test, 453, and see multi- 
ple range test 
Noncentrality, 499 
Nonparametric tests, 410, 413, 415 
Normal equations, 24, 334, and see least 
squares, matrix 
Nuisance parameter, 262 
Nutrition, 299, 373, 514, and see food 
technology 
Observational studies, 264 
Opinion research, 468 
Optional stopping, 421 
Order statistics, 417 
Organolepsis, 9, 51, 113, 496, and see 
scores 
color, 504 
flavor, 55 
Orthogonality, 456, and see comparisons 
nonorthogonality, 238, 456 
Orthogonal polynomials, 36 
Pairing, 264, and see balancing, paired 
comparisons, stratification 
Palatability, see organolepsis 
Percentages, see proportions 
Perception, see organolepsis 
Pharmacology, 149, 245, 406, and see 
analgesia, bioassay 
Physical science, 294, 309, 323, and see 
industrial research 
Physiology, 183, and see endocrinology, 
medicine 
Placebo, 406 
Plant breeding, 442, 545, 549 
Poisson distribution, 65, 150, 407, 459, 
485 
compound, 65 
Polya distribution, 66, 548 
Pooling, 303 
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Population, see distribution, statistical 
genetics 
changes, 65, 79 
density, effects of, 186 
Management, see ecology 
Poultry, 449 
Precision, 149, 262, and see information, 
least squares 
Prediction, see regression 
Preference, see organolepsis 
Probability, 80, 127, 452, 486 
conditional, 461 
generating function, 67, 81 
of disease, 547 
Proportions, 83, 513, and see binomial 
significance of, 407 
Psychology, 51, 242, 545, and see psy- 
chological tests 
Quadratic forms, 88, 275, 537 
Quality, see organolepsis 
Quantification, see scales 
Queue, 413, and see stochastic processes 
Randomization, 265, 494, and see selec- 
tion 
lacking, 310, 374, and see observa- 
tional studies 
Randomized blocks, 373, and _ see 
balancing, incomplete blocks 
Random process, 416, and see stochastic 
processes 
Random variable, 84, 117, 433 
Range, see tests using range 
quasi-, 412 
Ranks, 415, and see scores, transforma- 
tions 
Ratio, see proportion 
Recurrence formula, 68 
Regression, see adjusted means, canoni- 
cal analysis, correlation, covariance, 
orthogonal polynomials, trend 
adjustment, 311 
analysis, see analysis of covariance 
binomial, 225 
coefficient, 269 
variance of, 298 
curvilinear, 230 
equation, 508 
error, 295, 401 
external, 295 
heterogeneity of, 354 
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independent variable affected by treat- 
ments, 264, 287 
interblock, 279, 316 
interpretation of, 282 
model, 261, 304, 501 
multiple, 19, 178, 501 
of proportion, 225 
of ratio, 225 
repeated linear, 295 
treatment, 295 
Rejection of data, 117, and see missing 
values, selection test for, 118 
Reliability, 409 
Replication, fractional, 391, 411, and see 
alias 
Residuals, see error 
Response, see bioassay, dose response, 
model, time response quantal, 131, 
149, 415 
scale, 51 
Reversal, see cross-over design 
Sample size needed, 408, 442, 530, and 
see optional stopping 
Sampling, 71, 127, 248, 246, and see 
components of variance, design of 
experiments, sample size needed 
area, 245 
ecological, 71 
error, see variance 
Scales, 51, 409, 496, and see metric, 
scores 
Scedasticity, 164 
Scores, 55, 177, 496, and see discrimi- 
nant function, organolepsis, ranks, 
scales 
Selection, see balancing, design of ex- 
periments, genetic selection, ran- 
domization, rejection of data, 
sampling 
of data, 133 
of estimate, 438 
of subsets, 414 
Sensitivity data, 496, and see quantal 
response 
Sensory tests, see organolepsis 
Serology, see hematology, immunology 
Significance, 511, and see selection 
Sign test, 410 
Simultaneous equations, see matrix 
nonlinear, 54 
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Skewness, 71 
Sociology, 242, 270, 468 
Split plots, 396, 506 
Covariance adjustment, 268, 279, 
396 
Standard deviation, see variance 
Standard error, see variance 
Statistical control, see analysis of co- 
variance 
Stochastic processes, 79, 418, 419, 420, 
451, and see queue, random vari- 
ables 
Stratification, 373, and see sampling 
Student’s t, see t test 
Subjective evaluation, see organolepsis 
Successive approximation, see iteration 
Sufficient statistic, 28, 411, 414, 421, and 
see efficiency, estimation 
Survival curve, see dose response curve, 
time response curve 
Survival time, see time response curve 
Switchback, see cross-over design 
Symmetry, 410 
Systematic designs, 36, 374 
Systematic statistics, see order statistics 
Tables, miscellaneous, 28, 54, 366, 443, 
502, 515 
Taste tests, see organolepsis 
Taxonomy, 183, 238, 408 
Tests, see analysis of variance, Behrens- 
Fisher, chi square, confidence limits, 
F test, goodness of fit, least signifi- 
cant difference, life testing, null 
hypothesis, ranks, rejection of data, 
significance, t test 
accelerated, 497 
approximate, 510 
combination of, 318 
exact, 113, 370, 387 
fixed range, 511 
Multiple range, 13, 164, 344, 511 
Duncan’s, 13, 513 
Tukey’s, 13, 513 
power of, 174 
multivariate, 419 
of significance, 57, 273, 511 
of significance of 
difference, approximate, 407 
multiple correlation, 498 
regression, 285 
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variance, 264 
variance ratio, noncentral, 497 
one-sided, 502 
psychological, 409 
studentized range, 512 
two-tailed, 503 
using range, 413, and see fixed range 
test, multiple range test, studentized 
range test 
variances heterogeneous, 12 
Theory, see biometry, hypothesis, model 
Time response curve, 433, 546, and see 
dose response curve 
Time series, 19, 241, 245 
Topology, discriminatory, 185, and see 
discriminant function 
Transformations, 39, 64, 183, 415, and 
see additivity, analysis of variance, 
bioassay, canonical analysis, dis- 
criminant function, logistic curve, 
matrix, model 
angular, 407 
are sine, see angular 
inverse sine, see angular 
linear, 416 
logit, 29 
probit, 30, 157 
rankit, 53 
Trend, 415, and see regression 
T test, see confidence limits 
multiple, 513 
noncentral, 535 
Twins, 222 
Unequal subclasses, 333 
Uniformity data, 66 
Variance, see covariance, F test, genetics, 
mean deviation, mean square error 
analysis of, 5, 20, 36, 55, 139, 200, 261, 
333, 413, 454, 505, 514, 537, 541, 
and see additivity, analysis of co- 
variance, chi square, components of 
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variance, degrees of freedom, error, 
fitting constants, F test, interaction, 
least significant difference, least 
squares, main effect, missing values, 
model, multiple F test, multi- 
variate analysis, orthogonal poly- 
nomials, pooling, regression, tests, 
transformations, uniformity data 
model I, 498 
power of, 506 
approximate, 8 
components, 225, 282, 550 
computation of, 344 
effective, 377 
error, 262 
heterogeneous, 13 
of adjusted mean, 6, 15, 117, 269 
of difference, 6 
of difference of adjusted means, 368, 
377 
of estimate, 438 
of estimate of potency, 135, 143 
of regression coefficient, 117 
ratio, noncentral, 497 
related to mean, 485 
Vector, 451 
Viability, 451 
Virology, 11, 149 
Vitamins, 
A, 299 
B12, 35 
riboflavin, 131 
Weighted squares of means, method of, 
333 
Weighting, 29, 317, 349, 438 
Wild-life, 548 
Youden squares, see incomplete Latin 
square 
Zero-crossing, 416 
Z test, see F test 


JOURNAL OF THE 


AMERICAN STATISTICAL 
ASSOCIATION 


DECEMBER 1957 


Seasonal Adjustments by Electronic Computer Methods 
Julius Shiskin and Harry Eisenpress 
Problems in Measuring Long Term Growth in Income and Wealth 
Alexander Gerschenkron 
The Rational Origin for Measuring Subjective Values 
L. L. Thurstone and Lyle V. Jones 
Applications of a New Graphic Method in Statistical Measurement 
Jacob Mincer 
Graphic Computation of Ri.23. . . Frederick V. Waugh and Karl A. Fox 
Confidence Intervals for the Product of Two Binomial Parameters 
Robert J. Buehler 
Optimum Sampling in Binomial Populations. ...... Paul N. Somerville 
Estimates of Sampling Variance Where Two Units Are Selected from 


Estimation of Parameters from Incomplete Multivariate Samples 
George E. Nicholson, Jr. 
Truncation to Meet Requirements on Means..... Frank Eugene Clark 
The Midrange of a Sample as an Estimator of the Population Midrange 
Paul R. Rider 


John Mandel 


BOOK REVIEWS by Kenneth O. Alexander, James W. Angell, Jorge 
Arias B., Gerhard Bry, K. L. Chung, Jerome B. Cohen, Miles L. 
Colean, James F. Digby, Acheson J. Duncan, W. Duane Evans, 
John I. Griffin, John A. Howard, Ray Hyman, Leonard Kent, 
Benjamin J. Klebaner, William Lerner, Philip J. McCarthy, Ingram 
Olkin, Ellis R. Ott, Thornton Page, Lloyd Saville, Theresa R. Sha- 
piro, Irving H. Siegel, Robert E. Weintraub, B. L. Welch, Ruth 


415 


1 


388 £88 &§8 


VOLUME 52 NUMBER 280 


Information on membership, subscriptions, and back numbers should be 
requested from the Executive Director, American Statistical Association, 
1757 K Street, N.W., Washington 6, D. C., U.S. A. 


2 
= 
450 
458 
472 
479 
482 

Each Stratum... Nathan Keyfitz 503 
Applications of Multivariate Polykays to the Theory of Unbiased Ratio- 

Use of Dummy Variables in Regression Equations. . .. . Daniel B. Suits 
Fitting a Straight Line to Certain Types of Cumulative Data 

Estimating the Logistic Curve. ......................H. Silverstone 
617 


BACK ISSUES 


Back issues of Biometrics are available at the following postage-paid 
prices in U.S.A. currency: 
; Price per Price per 
Volume Number _ Single Number Volume(unbound) 


1to6 
1 to 6 
1to4 
1to4 
1to4 
lto4 
1to4 
lto4 
1to4 
1to4 
lto4 
lto4 


$ 6.00 


Ssssssssss 


$ 1.00 
1.00 
1.50 
1.50 
1.50 
1.50 
2.00 
2.00 
2.00 
2.00 
2.00 
2.00 


Inquiries, non-member subscriptions, and orders for back issues should 
be addressed to: 


Biomerrics 

DEPARTMENT OF STATISTICS 
Potytecanic InstiTUTE 
U.S.A, 


q 

1945 a 
1946 
1947 
1948 
1949 
1950 q = 
1951 
1952 
1953 
1954 10 
1955 ll 
1956 12 


; 
: 
q 
a 
a 
4 
q 
_< 
= 
ne 
| 
7 
“4 


